Nucleic acid-guided nucleases

ABSTRACT

Disclosed herein are nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Disclosed herein are engineered non-naturally occurring nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Targetable nuclease systems can be used to edit genetic targets, including recursive genetic engineering and trackable genetic engineering methods.

CROSS REFERENCE

This application is a continuation application of U.S. patentapplication Ser. No. 15/632,001, filed Jun. 23, 2017, which isincorporated herein by reference in its entirety.

BACKGROUND OF THE DISCLOSURE

Nucleic acid-guided nucleases have become important tools for researchand genome engineering. The applicability of these tools can be limitedby the sequence specificity requirements, expression, or deliveryissues.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has beensubmitted electronically in ASCII format and is hereby incorporated byreference in its entirety. Said ASCII copy, created on Jul. 14, 2017, isnamed 49022-717_201_SL.txt and is 809,754 bytes in size. Thisapplication contains a partial sequence list in Table 6.

SUMMARY OF THE DISCLOSURE

Disclosed herein are methods of modifying a target region in the genomeof a cell, the method comprising: (a) contacting a cell with: anon-naturally occurring nucleic-acid-guided nuclease encoded by anucleic acid having at least 80% identity to SEQ ID NO: 27; anengineered guide nucleic acid capable of complexing with the nucleicacid-guided nuclease; and an editing sequence encoding a nucleic acidcomplementary to said target region having a change in sequence relativeto the target region; and (b) allowing the nuclease, guide nucleic acid,and editing sequence to create a genome edit in a target region of thegenome of the cell. In some aspects, the engineered guide nucleic acidand the editing sequence are provided as a single nucleic acid. In someaspects, the single nucleic acid further comprises a mutation in aprotospacer adjacent motif (PAM) site. In some aspects, the nucleicacid-guided nuclease is encoded by a nucleic acid with at least 85%identity to SEQ ID NO: 47. In some aspects, the nucleic acid-guidednuclease is encoded by a nucleic acid with at least 85% identity to SEQID NO: 133.

Disclosed herein are nucleic acid-guided nuclease systems comprising:(a) a non-naturally occurring nuclease encoded by a nucleic acid havingat least 80% identity to SEQ ID NO: 27; (b) an engineered guide nucleicacid capable of complexing with the nucleic acid-guided nuclease, and(c) an editing sequence having a change in sequence relative to thesequence of a target region in a genome of a cell; wherein the systemresults in a genome edit in the target region in the genome of the cellfacilitated by the nuclease, the engineered guide nucleic acid, and theediting sequence. In some aspects, nucleic acid-guided nuclease isencoded by a nucleic acid with at least 85% identity to SEQ ID NO: 47.In some aspects, the nucleic acid-guided nuclease is encoded by anucleic acid with at least 85% identity to SEQ ID NO: 133. In someaspects, the nucleic acid-guided nuclease is codon optimized for thecell to be edited. In some aspects, the engineered guide nucleic acidand the editing sequence are provided as a single nucleic acid. In someaspects, the single nucleic acid further comprises a mutation in aprotospacer adjacent motif (PAM) site.

Disclosed herein are compositions for use in genome editing comprising anon-naturally occurring nuclease encoded by a nucleic acid having atleast 75% identity to SEQ ID NO: 27. In some aspects, the nucleic acidhas at least 80% identity to SEQ ID NO: 27. In some aspects, the nucleicacid has at least 90% identity to SEQ ID NO: 27. In some aspects, thenuclease is further codon optimized for use in cells from a particularorganism. In some aspects, the nuclease is codon optimized for E. ColiIn some aspects, the nuclease is codon optimized for S. Cerevisiae. Insome aspects, the nuclease is codon optimized for mammalian cells. Insome aspects, the nucleic acid-guided nuclease has less than 40% proteinidentity to SEQ ID NO: 12. In some aspects, the nucleic acid-guidednuclease has less than 40% protein identity to SEQ ID NO: 108.

INCORPORATION BY REFERENCE

All publications and patent applications mentioned in this specificationare herein incorporated by reference to the same extent as if eachindividual publication or patent application was specifically andindividually indicated to be incorporated by reference.

BRIEF DESCRIPTION OF THE DRAWINGS

This patent or application file contains at least one drawing executedin color. Copies of this patent or patent application publication withcolor drawing(s) will be provided by the Office upon request and paymentof the necessary fee.

FIG. 1A depicts a partial sequence alignment MAD1-8 (SEQ ID NO: 1-8) andMAD10-12 (SEQ ID NO: 10-12). FIG. 1A discloses residues 703-707 of SEQID NO: 1, residues 625-629 of SEQ ID NO: 2, residues 587-591 of SEQ IDNO: 3, residues 654-658 of SEQ ID NO: 4, residues 581-585 of SEQ ID NO:5, residues 637-641 of SEQ ID NO: 6, residues 590-594 of SEQ ID NO: 7,residues 645-649 of SEQ ID NO: 8, SEQ ID NO: 205, residues 619-623 ofSEQ ID NO: 10 and residues 603-607 of SEQ ID NO: 12, all disclosedrespectively, in order of appearance.

FIG. 1B depicts a phylogenetic tree of nucleases including MAD1-8.

FIG. 2 depicts an example protein expression construct. FIG. 2 discloses“6×-His” as SEQ ID NO: 182.

FIG. 3 depicts an example editing cassette. FIG. 3 discloses SEQ ID NOS183-185, respectively, in order of appearance.

FIG. 4 depicts an example screening or selection experiment workflow.

FIG. 5A depicts an example protein expression construct.

FIG. 5B depicts an example editing cassette.

FIG. 5C depicts an example screening or selection experiment workflow.

FIG. 6A depicts an example protein expression construct.

FIG. 6B depicts an example editing cassette.

FIG. 6C depicts an example screening or selection experiment workflow.

FIG. 7A-7B depicts example data from a functional nuclease complexscreening or selection experiment.

FIG. 8 depicts example data from a targetable nuclease complex-basedediting experiment.

FIG. 9 depicts example data from a targetable nuclease complex-basedediting experiment.

FIGS. 10A-10C depict example data from a targetable nucleasecomplex-based editing experiment.

FIG. 11 depicts a example sequence alignment of select sequences from anediting experiment. FIG. 11 discloses SEQ ID NOS 186-188, 187, 187, 187,187, 187, 189, 186, 187 and 187, respectively, in order of appearance.

FIG. 12 depicts example data from a targetable nuclease complex-basedediting experiment.

FIG. 13A depicts an example alignment of scaffold sequences (SEQ ID NOS190-202, respectively, in order of appearance).

FIG. 13B depicts an example model of a nucleic acid-guided nucleasecomplexed with a guide nucleic acid and a target sequence.

FIG. 14A-14B depict example data from a primer validation experiment.

FIG. 15 depicts example data from a targetable nuclease complex-basedediting experiment.

FIG. 16 depicts example validation data comparing results from twodifferent assays.

FIG. 17A-17C depict an example trackable genetic engineering workflow,including a plasmid comprising an editing cassette and a recordingcassette, and downstream sequencing of barcodes in order to identify theincorporated edit or mutation. FIG. 17B discloses SEQ ID NOS 203 and204, respectively, in order of appearance.

FIG. 18 depicts an example trackable genetic engineering workflow,including iterative rounds of engineering with a different editingcassette and recorder cassette with unique barcode (BC) at each round,which can be followed by selection and tracking to confirm thesuccessful engineering step at each round.

FIG. 19 depicts an example recursive engineering workflow.

DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure provides nucleic acid-guided nucleases andmethods of use. Often, the subject nucleic-acid guided nucleases arepart of a targetable nuclease system comprising a nucleic acid-guidednuclease and a guide nucleic acid. A subject targetable nuclease systemcan be used to cleave, modify, and/or edit a target polynucleotidesequence, often referred to as a target sequence. A subject targetablenuclease system refers collectively to transcripts and other elementsinvolved in the expression of or directing the activity of genes, whichmay include sequences encoding a subject nucleic acid-guided nucleaseprotein and a guide nucleic acid as disclosed herein.

Methods, systems, vectors, polynucleotides, and compositions describedherein may be used in various applications including altering ormodifying synthesis of a gene product, such as a protein, polynucleotidecleavage, polynucleotide editing, polynucleotide splicing; traffickingof target polynucleotide, tracing of target polynucleotide, isolation oftarget polynucleotide, visualization of target polynucleotide, etc.Aspects of the invention also encompass methods and uses of thecompositions and systems described herein in genome engineering, e.g.for altering or manipulating the expression of one or more genes or theone or more gene products, in prokaryotic, archaeal, or eukaryoticcells, in vitro, in vivo or ex vivo.

Nucleic Acid-Guided Nucleases

Bacterial and archaeal targetable nuclease systems have emerged aspowerful tools for precision genome editing. However, naturallyoccurring nucleases have some limitations including expression anddelivery challenges due to the nucleic acid sequence and protein size.Targetable nucleases that require PAM recognition are also limited inthe sequences they can target throughout a genetic sequence. Otherchallenges include processivity, target recognition specificity andefficiency, and nuclease acidity efficiency, which often effect geneticediting efficiency.

Non-naturally occurring targetable nucleases and non-naturally occurringtargetable nuclease systems can address many of these challenges andlimitations.

Disclosed herein are non-naturally targetable nuclease systems. Suchtargetable nuclease systems are engineered to address one or more of thechallenges described above and can be referred to as engineered nucleasesystems. Engineered nuclease systems can comprise one or more of anengineered nuclease, such as an engineered nucleic acid-guided nuclease,an engineered guide nucleic acid, an engineered polynucleotides encodingsaid nuclease, or an engineered polynucleotides encoding said guidenucleic acid. Engineered nucleases, engineered guide nucleic acids, andengineered polynucleotides encoding the engineered nuclease orengineered guide nucleic acid are not naturally occurring and are notfound in nature. It follows that engineered nuclease systems includingone or more of these elements are non-naturally occurring.

Non-limiting examples of types of engineering that can be done to obtaina non-naturally occurring nuclease system are as follows. Engineeringcan include codon optimization to facilitate expression or improveexpression in a host cell, such as a heterologous host cell. Engineeringcan reduce the size or molecular weight of the nuclease in order tofacilitate expression or delivery. Engineering can alter PAM selectionin order to change PAM specificity or to broaden the range of recognizedPAMs. Engineering can alter, increase, or decrease stability,processivity, specificity, or efficiency of a targetable nucleasesystem. Engineering can alter, increase, or decrease protein stability.Engineering can alter, increase, or decrease processivity of nucleicacid scanning. Engineering can alter, increase, or decrease targetsequence specificity. Engineering can alter, increase, or decreasenuclease activity. Engineering can alter, increase, or decrease editingefficiency. Engineering can alter, increase, or decrease transformationefficiency. Engineering can alter, increase, or decrease nuclease orguide nucleic acid expression.

Examples of non-naturally occurring nucleic acid sequences which aredisclosed herein include sequences codon optimized for expression inbacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codonoptimized for expression in single cell eukaryotes, such as yeast (e.g.,SEQ ID NO: 127-146), sequences codon optimized for expression in multicell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166),polynucleotides used for cloning or expression of any sequencesdisclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleicacid sequences (e.g., SEQ ID NO: 21-40) operably linked to aheterologous promoter or nuclear localization signal or otherheterologous element, proteins generated from engineered or codonoptimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineeredguide nucleic acids comprising any one of SEQ ID NO: 84-107. Suchnon-naturally occurring nucleic acid sequences can be amplified, cloned,assembled, synthesized, generated from synthesized oligonucleotides ordNTPs, or otherwise obtained using methods known by those skilled in theart.

Disclosed herein are nucleic acid-guided nucleases. Subject nucleasesare functional in vitro, or in prokaryotic, archaeal, or eukaryoticcells for in vitro, in vivo, or ex vivo applications. Suitable nucleicacid-guided nucleases can be from an organism from a genus whichincludes but is not limited to Thiomicrospira, Succinivibrio,Candidatus, Porphyromonas, Acidaminococcus, Acidomonococcus, Prevotella,Smithella, Moraxella, Synergistes, Francisella, Leptospira,Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus,Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter,Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium,Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola,Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter,Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor,Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes,Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium,Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria,Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter,Rhodobacter, Sphaerochaeta, Tuberibacillus, Oleiphilus, Omnitrophica,Parcubacteria, and Campylobacter. Species of organism of such a genuscan be as otherwise herein discussed. Suitable nucleic acid-guidednucleases can be from an organism from a genus or unclassified genuswithin a kingdom which includes but is not limited to Firmicute,Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, andTenericutes. Suitable nucleic acid-guided nucleases can be from anorganism from a genus or unclassified genus within a phylum whichincludes but is not limited to Erysipelotrichia, Clostridia, Bacilli,Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria,Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria,Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleicacid-guided nucleases can be from an organism from a genus orunclassified genus within an order which includes but is not limited toClostridiales, Lactobacillales, Actinomycetales, Bacteroidales,Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales,Neisseriales, Legionellales, Nautiliales, Campylobacterales,Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleicacid-guided nucleases can be from an organism from a genus orunclassified genus within a family which includes but is not limited toLachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae,Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae,Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium,Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae,Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae,Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae,Pisciririckettsiaceae, and Francisellaceae. Other nucleic acid-guidednucleases have been describe in US Patent Application Publication No.US20160208243 filed Dec. 18, 2015, US Application Publication No.US20140068797 filed Mar. 15, 2013, U.S. Pat. No. 8,697,359 filed Oct.15, 2013, and Zetsche et al., Cell 2015 Oct. 22; 163(3):759-71, each ofwhich are incorporated herein by reference in their entirety.

Some nucleic acid-guided nucleases suitable for use in the methods,systems, and compositions of the present disclosure include thosederived from an organism such as, but not limited to, Thiomicrospira sp.XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, CandidatusMethanoplasma termitum, Candidatus Methanomethylophilus alvus,Porphyromonas crevioricanis, Flavobacterium branchiophilum,Acidaminococcus Sp., Acidomonococcus sp., Lachnospiraceae bacteriumCOE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxellabovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisellatularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp.crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S.sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N.tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae;L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C.sordellii; Francisella tularensis 1, Prevotella albensis,Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus,Butyrivibrio proteoclasticus B316, Peregrinibacteria bacteriumGW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithellasp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020,Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxellabovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006,Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonasmacacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridialesbacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena,Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillussp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici,Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillusversmoldensis, Filifactor alocis ATCC 35896, Alicyclobacillusacidoterrestris, Alicyclobacillus acidoterrestris ATCC 49025,Desulfovibrio inopinatus, Desulfovibrio inopinatus DSM 10711, Oleiphilussp. Oleiphilus sp. HI0009, Candidtus kefeldibacteria, ParcubacteriaCasY.4, Omnitrophica WOR 2 bacterium GWF2, Bacillus sp. NSP2.1, andBacillus thermoamylovorans.

In some instances, a nucleic acid-guided nuclease disclosed hereincomprises an amino acid sequence comprising at least 50% amino acididentity to any one of SEQ ID NO: 1-20. In some instances, a nucleasecomprises an amino acid sequence comprising at least about 10%, 20%,30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%,or 100% amino acid identity to any one of SEQ ID NO: 1-20. In somecases, the nucleic acid-guided nuclease comprises at least about 50%,60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acididentity to any one of SEQ ID NO: 1-20. In some cases, the nucleicacid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%,80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one ofSEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nucleasecomprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or10-11. In some cases, the nucleic acid-guided nuclease comprises atleast about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95%, amino acid identity to SEQ ID NO: 2. In some cases, the nucleicacid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%,80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO:7.

In some cases, the nucleic acid-guided nuclease comprises any one of SEQID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprisesany one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleicacid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-11. Insome cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 2. Insome cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 7.

In some instances, a nucleic acid-guided nuclease comprises an aminoacid sequence comprising at most 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%,50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% amino acid identityto any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, anucleic acid-guided nuclease comprises an amino acid sequence comprisingat most 50% amino acid identity to any one of SEQ ID NO: 12 or SEQ IDNO: 108-110. In some instances, a nucleic acid-guided nuclease comprisesan amino acid sequence comprising at most 45% amino acid identity to anyone of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleicacid-guided nuclease comprises an amino acid sequence comprising at most40% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO:108-110. In some instances, a nucleic acid-guided nuclease comprises anamino acid sequence comprising at most 35% amino acid identity to anyone of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleicacid-guided nuclease comprises an amino acid sequence comprising at most30% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO:108-110.

In some instances, a nucleic acid-guided nuclease disclosed herein isencoded by a nucleic acid sequence comprising at least 50% sequenceidentity to any one of SEQ ID NO: 21-40. In some instances, a nucleaseis encoded by a nucleic acid sequence comprising at least about 10%,20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95%, or 100% sequence identity to any one of SEQ ID NO: 21-40. In someinstances, a nuclease is encoded by a nucleic acid sequence comprisingat least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases,the nucleic acid-guided nuclease is encoded by the nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ IDNO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease isencoded by the nucleic acid sequence comprising at least about 50%, 60%,65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identityto any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence comprisingat least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95%, sequence identity to SEQ ID NO: 22. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence comprisingat least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95%, sequence identity to SEQ ID NO: 27.

In some cases, the nucleic acid-guided nuclease is encoded by thenucleic acid sequence of any one of SEQ ID NO: 21-40. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequence ofany one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence of any oneof SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guidednuclease is encoded by the nucleic acid sequence of SEQ ID NO: 22. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence of SEQ ID NO: 27.

In some instances, a nucleic acid-guided nuclease disclosed herein isencoded on a nucleic acid sequence. Such a nucleic acid can be codonoptimized for expression in a desired host cell. Suitable host cells caninclude, as non-limiting examples, prokaryotic cells such as E. coli, P.aeruginosa, B. subtilus, and V. natriegens, and eukaryotic cells such asS. cerevisiae, plant cells, insect cells, nematode cells, amphibiancells, fish cells, or mammalian cells, including human cells.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can becodon optimized for expression in gram positive bacteria, e.g., Bacillussubtilis, or gram negative bacteria, e.g., E. coli. In some instances, anucleic acid-guided nuclease disclosed herein is encoded by a nucleicacid sequence comprising at least 50% sequence identity to any one ofSEQ ID NO: 41-60. In some instances, a nuclease is encoded by a nucleicacid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%,65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequenceidentity to any one of SEQ ID NO: 41-60. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence comprisingat least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95% sequence identity to any one of SEQ ID NO: 41-60. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or50-52. In some cases, the nucleic acid-guided nuclease is encoded by thenucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%,80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nucleaseis encoded by the nucleic acid sequence comprising at least about 50%,60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequenceidentity to SEQ ID NO: 42. In some cases, the nucleic acid-guidednuclease is encoded by the nucleic acid sequence comprising at leastabout 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%sequence identity to SEQ ID NO: 47.

In some cases, the nucleic acid-guided nuclease is encoded by thenucleic acid sequence of any one of SEQ ID NO: 41-60. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequence ofany one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence of any oneof SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guidednuclease is encoded by the nucleic acid sequence of SEQ ID NO: 42. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence of SEQ ID NO: 47.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can becodon optimized for expression in a species of yeast, e.g., S.cerevisiae. In some instances, a nucleic acid-guided nuclease disclosedherein is encoded by a nucleic acid sequence comprising at least 50%sequence identity to any one of SEQ ID NO: 127-146. In some instances, anuclease is encoded by a nucleic acid sequence comprising at least about10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greaterthan 95%, or 100% sequence identity to any one of SEQ ID NO: 127-146. Insome instances, a nuclease is encoded by a nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95% sequence identity to any one of SEQ ID NO: 127-146. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ IDNO: 127-146. In some cases, the nucleic acid-guided nuclease is encodedby the nucleic acid sequence comprising at least about 50%, 60%, 65%,70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to anyone of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence comprisingat least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than95% sequence identity to any one of SEQ ID NO: 127-134 or 136-137. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 128. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 133.

In some cases, the nucleic acid-guided nuclease is encoded by thenucleic acid sequence of any one of SEQ ID NO: 127-146. In some cases,the nucleic acid-guided nuclease is encoded by the nucleic acid sequenceof any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence of any oneof SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guidednuclease is encoded by the nucleic acid sequence of SEQ ID NO: 128. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence of SEQ ID NO: 133.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can becodon optimized for expression in mammalian cells. In some instances, anucleic acid-guided nuclease disclosed herein is encoded by a nucleicacid sequence comprising at least 50% sequence identity to any one ofSEQ ID NO: 147-166. In some instances, a nuclease is encoded by anucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%,60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100%sequence identity to any one of SEQ ID NO: 147-166. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95% sequence identity to any one of SEQ ID NO: 147-166. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ IDNO: 147-154 or 156-158. In some cases, the nucleic acid-guided nucleaseis encoded by the nucleic acid sequence comprising at least about 50%,60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequenceidentity to any one of SEQ ID NO: 147-154 or 156-157. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95% sequence identity to SEQ ID NO: 148. In some cases, thenucleic acid-guided nuclease is encoded by the nucleic acid sequencecomprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,greater than 95% sequence identity to SEQ ID NO: 153.

In some cases, the nucleic acid-guided nuclease is encoded by thenucleic acid sequence of any one of SEQ ID NO: 147-166. In some cases,the nucleic acid-guided nuclease is encoded by the nucleic acid sequenceof any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleicacid-guided nuclease is encoded by the nucleic acid sequence of any oneof SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guidednuclease is encoded by the nucleic acid sequence of SEQ ID NO: 148. Insome cases, the nucleic acid-guided nuclease is encoded by the nucleicacid sequence of SEQ ID NO: 153.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can beoperably linked to a promoter. Such nucleic acid sequences can be linearor circular. The nucleic acid sequences can be comprised on a largerlinear or circular nucleic acid sequences that comprises additionalelements such as an origin of replication, selectable or screenablemarker, terminator, other components of a targetable nuclease system,such as a guide nucleic acid, or an editing or recorder cassette asdisclosed herein. These larger nucleic acid sequences can be recombinantexpression vectors, as are described in more detail later.

Guide Nucleic Acid

In general, a guide nucleic acid can complex with a compatible nucleicacid-guided nuclease and can hybridize with a target sequence, therebydirecting the nuclease to the target sequence. A subject nucleicacid-guided nuclease capable of complexing with a guide nucleic acid canbe referred to as a nucleic acid-guided nuclease that is compatible withthe guide nucleic acid. Likewise, a guide nucleic acid capable ofcomplexing with a nucleic acid-guided nuclease can be referred to as aguide nucleic acid that is compatible with the nucleic acid-guidednucleases.

A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. Aguide nucleic acid can comprise both DNA and RNA. A guide nucleic acidcan comprise modified of non-naturally occurring nucleotides. In caseswhere the guide nucleic acid comprises RNA, the RNA guide nucleic acidcan be encoded by a DNA sequence on a polynucleotide molecule such as aplasmid, linear construct, or editing cassette as disclosed herein.

A guide nucleic acid can comprise a guide sequence. A guide sequence isa polynucleotide sequence having sufficient complementarity with atarget polynucleotide sequence to hybridize with the target sequence anddirect sequence-specific binding of a complexed nucleic acid-guidednuclease to the target sequence. The degree of complementarity between aguide sequence and its corresponding target sequence, when optimallyaligned using a suitable alignment algorithm, is about or more thanabout 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimalalignment may be determined with the use of any suitable algorithm foraligning sequences. In some embodiments, a guide sequence is about ormore than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotidesin length. In some embodiments, a guide sequence is less than about 75,50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guidesequence is 10-30 nucleotides long. The guide sequence can be 15-20nucleotides in length. The guide sequence can be 15 nucleotides inlength. The guide sequence can be 16 nucleotides in length. The guidesequence can be 17 nucleotides in length. The guide sequence can be 18nucleotides in length. The guide sequence can be 19 nucleotides inlength. The guide sequence can be 20 nucleotides in length.

A guide nucleic acid can comprise a scaffold sequence. In general, a“scaffold sequence” includes any sequence that has sufficient sequenceto promote formation of a targetable nuclease complex, wherein thetargetable nuclease complex comprises a nucleic acid-guided nuclease anda guide nucleic acid comprising a scaffold sequence and a guidesequence. Sufficient sequence within the scaffold sequence to promoteformation of a targetable nuclease complex may include a degree ofcomplementarity along the length of two sequence regions within thescaffold sequence, such as one or two sequence regions involved informing a secondary structure. In some cases, the one or two sequenceregions are comprised or encoded on the same polynucleotide. In somecases, the one or two sequence regions are comprised or encoded onseparate polynucleotides. Optimal alignment may be determined by anysuitable alignment algorithm, and may further account for secondarystructures, such as self-complementarity within either the one or twosequence regions. In some embodiments, the degree of complementaritybetween the one or two sequence regions along the length of the shorterof the two when optimally aligned is about or more than about 25%, 30%,40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In someembodiments, at least one of the two sequence regions is about or morethan about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,25, 30, 40, 50, or more nucleotides in length.

A scaffold sequence of a subject guide nucleic acid can comprise asecondary structure. A secondary structure can comprise a pseudoknotregion. In some cases, binding kinetics of a guide nucleic acid to anucleic acid-guided nuclease is determined in part by secondarystructures within the scaffold sequence. In some cases, binding kineticsof a guide nucleic acid to a nucleic acid-guided nuclease is determinedin part by nucleic acid sequence with the scaffold sequence.

A scaffold sequence can comprise the sequence of any one of SEQ ID NO:84-107. A scaffold sequence can comprise the sequence of any one of SEQID NO: 84-103. A scaffold sequence can comprise the sequence of any oneof SEQ ID NO: 84-91 or 93-95. A scaffold sequence can comprise thesequence of any one of SEQ ID NO: 88, 93, 94, or 95. A scaffold sequencecan comprise the sequence of SEQ ID NO: 88. A scaffold sequence cancomprise the sequence of SEQ ID NO: 93. A scaffold sequence can comprisethe sequence of SEQ ID NO: 94. A scaffold sequence can comprise thesequence of SEQ ID NO: 95.

In some aspects, the invention provides a nuclease that binds to a guidenucleic acid comprising a conserved scaffold sequence. For example, thenucleic acid-guided nucleases for use in the present disclosure can bindto a conserved pseudoknot region as shown in FIG. 13A. Specifically, thenucleic acid-guided nucleases for use in the present disclosure can bindto a guide nucleic acid comprising a conserved pseudoknot region asshown in FIG. 13A. Certain nucleic acid-guided nucleases for use in thepresent disclosure can bind to a pseudoknot region having at least 75%,80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot regionof Scaffold-1 (SEQ ID NO: 172). Other nucleic acid-guided nucleases foruse in the present disclosure can bind to a pseudoknot region having atleast 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to thepseudoknot region of Scaffold-3 (SEQ ID NO: 173). Still other nucleicacid-guided nucleases for use in the present disclosure can bind to apseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100%sequence identity to the pseudoknot region of Scaffold-4 (SEQ ID NO:174). Other nucleic acid-guided nucleases for use in the presentdisclosure can bind to a pseudoknot region having at least 75%, 80%,85%, 90%, 95%, or 100% sequence identity to the pseudoknot region ofScaffold-5 (SEQ ID NO: 175). Other nucleic acid-guided nucleases for usein the present disclosure can bind to a pseudoknot region having atleast 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to thepseudoknot region of Scaffold-6 (SEQ ID NO: 176). Still other nucleicacid-guided nucleases for use in the present disclosure can bind to apseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100%sequence identity to the pseudoknot region of Scaffold-7 (SEQ ID NO:177). Other nucleic acid-guided nucleases for use in the presentdisclosure can bind to a pseudoknot region having at least 75%, 80%,85%, 90%, 95%, or 100% sequence identity to the pseudoknot region ofScaffold-8 (SEQ ID NO: 178). Other nucleic acid-guided nucleases for usein the present disclosure can bind to a pseudoknot region having atleast 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to thepseudoknot region of Scaffold-10 (SEQ ID NO: 179). Still other nucleicacid-guided nucleases for use in the present disclosure can bind to apseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100%sequence identity to the pseudoknot region of Scaffold-11 (SEQ ID NO:180). Certain nucleic acid-guided nucleases for use in the presentdisclosure can bind to a pseudoknot region having at least 75%, 80%,85%, 90%, 95%, or 100% sequence identity to the pseudoknot region ofScaffold-12 (SEQ ID NO: 181).

A guide nucleic acid can comprise the sequence of any one of SEQ ID NO:84-107. A guide nucleic acid can comprise the sequence of any one of SEQID NO: 84-103. A guide nucleic acid can comprise the sequence of any oneof SEQ ID NO: 84-91 or 93-95. A guide nucleic acid can comprise thesequence of any one of SEQ ID NO: 88, 93, 94, or 95. A guide nucleicacid can comprise the sequence of SEQ ID NO: 88. A guide nucleic acidcan comprise the sequence of SEQ ID NO: 93. A guide nucleic acid cancomprise the sequence of SEQ ID NO: 94. A guide nucleic acid cancomprise the sequence of SEQ ID NO: 95.

In aspects of the invention the terms “guide nucleic acid” refers to oneor more polynucleotides comprising 1) a guide sequence capable ofhybridizing to a target sequence and 2) a scaffold sequence capable ofinteracting with or complexing with an nucleic acid-guided nuclease asdescribed herein. A guide nucleic acid may be provided as one or morenucleic acids. In specific embodiments, the guide sequence and thescaffold sequence are provided as a single polynucleotide.

A guide nucleic acid can be compatible with a nucleic acid-guidednuclease when the two elements can form a functional targetable nucleasecomplex capable of cleaving a target sequence. Often, a compatiblescaffold sequence for a compatible guide nucleic acid can be found byscanning sequences adjacent to a native nucleic acid-guided nucleaseloci. In other words, native nucleic acid-guided nucleases can beencoded on a genome within proximity to a corresponding compatible guidenucleic acid or scaffold sequence.

Nucleic acid-guided nucleases can be compatible with guide nucleic acidsthat are not found within the nucleases endogenous host. Such orthogonalguide nucleic acids can be determined by empirical testing. Orthogonalguide nucleic acids can come from different bacterial species or besynthetic or otherwise engineered to be non-naturally occurring.

Orthogonal guide nucleic acids that are compatible with a common nucleicacid-guided nuclease can comprise one or more common features. Commonfeatures can include sequence outside a pseudoknot region. Commonfeatures can include a pseudoknot region. Common features can include aprimary sequence or secondary structure.

A guide nucleic acid can be engineered to target a desired targetsequence by altering the guide sequence such that the guide sequence iscomplementary to the target sequence, thereby allowing hybridizationbetween the guide sequence and the target sequence. A guide nucleic acidwith an engineered guide sequence can be referred to as an engineeredguide nucleic acid. Engineered guide nucleic acids are oftennon-naturally occurring and are not found in nature.

Targetable Nuclease System

Disclosed herein are targetable nuclease systems. A targetable nucleasesystem can comprise a nucleic acid-guided nuclease and a compatibleguide nucleic acid. A targetable nuclease system can comprise a nucleicacid-guided nuclease or a polynucleotide sequence encoding the nucleicacid-guided nuclease. A targetable nuclease system can comprise a guidenucleic acid or a polynucleotide sequence encoding the guide nucleicacid.

In general, a targetable nuclease system as disclosed herein ischaracterized by elements that promote the formation of a targetablenuclease complex at the site of a target sequence, wherein thetargetable nuclease complex comprises a nucleic acid-guided nuclease anda guide nucleic acid.

A guide nucleic acid together with a nucleic acid-guided nuclease formsa targetable nuclease complex which is capable of binding to a targetsequence within a target polynucleotide, as determined by the guidesequence of the guide nucleic acid.

In general, to generate a double stranded break, in most cases atargetable nuclease complex binds to a target sequence as determined bythe guide nucleic acid, and the nuclease has to recognize a protospaceradjacent motif (PAM) sequence adjacent to the target sequence.

A targetable nuclease complex can comprise a nucleic acid-guidednuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleicacid. A targetable nuclease complex can comprise a nucleic acid-guidednuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guidenucleic acid. A targetable nuclease complex can comprise a nucleicacid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and acompatible guide nucleic acid. A targetable nuclease complex cancomprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatibleguide nucleic acid. A targetable nuclease complex can comprise a nucleicacid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleicacid. In any of these cases, the guide nucleic acid can comprise ascaffold sequence compatible with the nucleic acid-guided nuclease. Inany of these cases, the guide nucleic acid can further comprise a guidesequence. The guide sequence can be engineered to target any desiredtarget sequence. The guide sequence can be engineered to becomplementary to any desired target sequence. The guide sequence can beengineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guidednuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleicacid comprising any one of SEQ ID NO: 84-107. A targetable nucleasecomplex can comprise a nucleic acid-guided nuclease of any one of SEQ IDNO: 1-8 or 10-12 and a compatible guide nucleic acid comprising any oneof SEQ ID NO: 84-95. A targetable nuclease complex can comprise anucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and acompatible guide nucleic acid comprising any one of SEQ ID NO: 84-91 or93-95. In any of these cases, the guide nucleic acid can furthercomprise a guide sequence. The guide sequence can be engineered totarget any desired target sequence. The guide sequence can be engineeredto be complementary to any desired target sequence. The guide sequencecan be engineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guidednuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. Atargetable nuclease complex can comprise a nucleic acid-guided nucleaseof SEQ ID NO: 2 and a compatible guide nucleic acid comprising any oneof SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex cancomprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatibleguide nucleic acid comprising SEQ ID NO: 88. A targetable nucleasecomplex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 anda compatible guide nucleic acid comprising SEQ ID NO: 93. A targetablenuclease complex can comprise a nucleic acid-guided nuclease of SEQ IDNO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 94. Atargetable nuclease complex can comprise a nucleic acid-guided nucleaseof SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ IDNO: 95. In any of these cases, the guide nucleic acid can furthercomprise a guide sequence. The guide sequence can be engineered totarget any desired target sequence. The guide sequence can be engineeredto be complementary to any desired target sequence. The guide sequencecan be engineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guidednuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. Atargetable nuclease complex can comprise a nucleic acid-guided nucleaseof SEQ ID NO: 7 and a compatible guide nucleic acid comprising any oneof SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex cancomprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatibleguide nucleic acid comprising SEQ ID NO: 88. A targetable nucleasecomplex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 anda compatible guide nucleic acid comprising SEQ ID NO: 93. A targetablenuclease complex can comprise a nucleic acid-guided nuclease of SEQ IDNO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 94. Atargetable nuclease complex can comprise a nucleic acid-guided nucleaseof SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ IDNO: 95. In any of these cases, the guide nucleic acid can furthercomprise a guide sequence. The guide sequence can be engineered totarget any desired target sequence. The guide sequence can be engineeredto be complementary to any desired target sequence. The guide sequencecan be engineered to hybridize to any desired target sequence.

A target sequence of a targetable nuclease complex can be anypolynucleotide endogenous or exogenous to a prokaryotic or eukaryoticcell, or in vitro. For example, the target sequence can be apolynucleotide residing in the nucleus of the eukaryotic cell. A targetsequence can be a sequence coding a gene product (e.g., a protein) or anon-coding sequence (e.g., a regulatory polynucleotide or a junk DNA).Without wishing to be bound by theory, it is believed that the targetsequence should be associated with a PAM; that is, a short sequencerecognized by a targetable nuclease complex. The precise sequence andlength requirements for a PAM differ depending on the nucleicacid-guided nuclease used, but PAMs are typically 2-5 base pairsequences adjacent the target sequence. Examples of PAM sequences aregiven in the examples section below, and the skilled person will be ableto identify further PAM sequences for use with a given nucleicacid-guided nuclease. Further, engineering of the PAM Interacting (PI)domain may allow programming of PAM specificity, improve target siterecognition fidelity, and increase the versatility of a nucleicacid-guided nuclease genome engineering platform. Nucleic acid-guidednucleases may be engineered to alter their PAM specificity, for exampleas described in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleaseswith altered PAM specificities. Nature. 2015 Jul. 23; 523 (7561): 481-5.doi: 10.1038/nature14592.

A PAM site is a nucleotide sequence in proximity to a target sequence.In most cases, a nucleic acid-guided nuclease can only cleave a targetsequence if an appropriate PAM is present. PAMs are nucleic acid-guidednuclease-specific and can be different between two different nucleicacid-guided nucleases. A PAM can be 5′ or 3′ of a target sequence. A PAMcan be upstream or downstream of a target sequence. A PAM can be 1, 2,3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides in length. Often, a PAM isbetween 2-6 nucleotides in length.

In some examples, a PAM can be provided on a separate oligonucleotide.In such cases, providing PAM on a oligonucleotide allows cleavage of atarget sequence that otherwise would not be able to be cleave because noadjacent PAM is present on the same polynucleotide as the targetsequence.

Polynucleotide sequences encoding a component of a targetable nucleasesystem can comprise one or more vectors. In general, the term “vector”refers to a nucleic acid molecule capable of transporting anothernucleic acid to which it has been linked. Vectors include, but are notlimited to, nucleic acid molecules that are single-stranded,double-stranded, or partially double-stranded; nucleic acid moleculesthat comprise one or more free ends, no free ends (e.g. circular);nucleic acid molecules that comprise DNA, RNA, or both; and othervarieties of polynucleotides known in the art. One type of vector is a“plasmid,” which refers to a circular double stranded DNA loop intowhich additional DNA segments can be inserted, such as by standardmolecular cloning techniques. Another type of vector is a viral vector,wherein virally-derived DNA or RNA sequences are present in the vectorfor packaging into a virus (e.g. retroviruses, replication defectiveretroviruses, adenoviruses, replication defective adenoviruses, andadeno-associated viruses). Viral vectors also include polynucleotidescarried by a virus for transfection into a host cell. Certain vectorsare capable of autonomous replication in a host cell into which they areintroduced (e.g. bacterial vectors having a bacterial origin ofreplication and episomal mammalian vectors). Other vectors (e.g.,non-episomal mammalian vectors) are integrated into the genome of a hostcell upon introduction into the host cell, and thereby are replicatedalong with the host genome. Moreover, certain vectors are capable ofdirecting the expression of genes to which they are operatively-linked.Such vectors are referred to herein as “expression vectors.” Commonexpression vectors of utility in recombinant DNA techniques are often inthe form of plasmids. Further discussion of vectors is provided herein.

Recombinant expression vectors can comprise a nucleic acid of theinvention in a form suitable for expression of the nucleic acid in ahost cell, which means that the recombinant expression vectors includeone or more regulatory elements, which may be selected on the basis ofthe host cells to be used for expression, that is operatively-linked tothe nucleic acid sequence to be expressed. Within a recombinantexpression vector, “operably linked” is intended to mean that thenucleotide sequence of interest is linked to the regulatory element(s)in a manner that allows for expression of the nucleotide sequence (e.g.in an in vitro transcription/translation system or in a host cell whenthe vector is introduced into the host cell). With regards torecombination and cloning methods, mention is made of U.S. patentapplication Ser. No. 10/815,730, published Sep. 2, 2004 as US2004-0171156 A1, the contents of which are herein incorporated byreference in their entirety.

In some embodiments, a regulatory element is operably linked to one ormore elements of a targetable nuclease system so as to drive expressionof the one or more components of the targetable nuclease system.

In some embodiments, a vector comprises a regulatory element operablylinked to a polynucleotide sequence encoding a nucleic acid-guidednuclease. The polynucleotide sequence encoding the nucleic acid-guidednuclease can be codon optimized for expression in particular cells, suchas prokaryotic or eukaryotic cells. Eukaryotic cells can be yeast,fungi, algae, plant, animal, or human cells. Eukaryotic cells may bethose of or derived from a particular organism, such as a mammal,including but not limited to human, mouse, rat, rabbit, dog, ornon-human mammal including non-human primate.

In general, codon optimization refers to a process of modifying anucleic acid sequence for enhanced expression in the host cells ofinterest by replacing at least one codon (e.g. about or more than about1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the nativesequence with codons that are more frequently or most frequently used inthe genes of that host cell while maintaining the native amino acidsequence. Various species exhibit particular bias for certain codons ofa particular amino acid. Codon bias (differences in codon usage betweenorganisms) often correlates with the efficiency of translation ofmessenger RNA (mRNA), which is in turn believed to be dependent on,among other things, the properties of the codons being translated andthe availability of particular transfer RNA (tRNA) molecules. Thepredominance of selected tRNAs in a cell is generally a reflection ofthe codons used most frequently in peptide synthesis. Accordingly, genescan be tailored for optimal gene expression in a given organism based oncodon optimization. Codon usage tables are readily available, forexample, at the “Codon Usage Database” available atwww.kazusa.orjp/codon/ (visited Jul. 9, 2002), and these tables can beadapted in a number of ways. See Nakamura, Y., et al. “Codon usagetabulated from the international DNA sequence databases: status for theyear 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codonoptimizing a particular sequence for expression in a particular hostcell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), arealso available. In some embodiments, one or more codons (e.g. 1, 2, 3,4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encodingan engineered nuclease correspond to the most frequently used codon fora particular amino acid.

In some embodiments, a vector encodes a nucleic acid-guided nucleasecomprising one or more nuclear localization sequences (NLSs), such asabout or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. Insome embodiments, the engineered nuclease comprises about or more thanabout 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near theamino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10,or more NLSs at or near the carboxy-terminus, or a combination of these(e.g. one or more NLS at the amino-terminus and one or more NLS at thecarboxy terminus). When more than one NLS is present, each may beselected independently of the others, such that a single NLS may bepresent in more than one copy and/or in combination with one or moreother NLSs present in one or more copies. In a preferred embodiment ofthe invention, the engineered nuclease comprises at most 6 NLSs. In someembodiments, an NLS is considered near the N- or C-terminus when thenearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20,25, 30, 40, 50, or more amino acids along the polypeptide chain from theN- or C-terminus. Non-limiting examples of NLSs include an NLS sequencederived from: the NLS of the SV40 virus large T-antigen, having theamino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin(e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK(SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD(SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS havingthe sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115);the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQID NO:117) and PPKKARED (SEQ ID NO:118) of the myoma T protein; thesequence PQPKKKPL (SEQ ID NO:119) of human p53; the sequenceSALIKKKKKMAP (SEQ ID NO:120) of mouse c-abl IV; the sequences DRLRR (SEQID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; thesequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus deltaantigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the humanpoly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ IDNO: 126) of the steroid hormone receptors (human) glucocorticoid.

In general, the one or more NLSs are of sufficient strength to driveaccumulation of the nucleic acid-guided nuclease in a detectable amountin the nucleus of a eukaryotic cell. In general, strength of nuclearlocalization activity may derive from the number of NLSs, the particularNLS(s) used, or a combination of these factors. Detection ofaccumulation in the nucleus may be performed by any suitable technique.For example, a detectable marker may be fused to the nucleic acid-guidednuclease, such that location within a cell may be visualized, such as incombination with a means for detecting the location of the nucleus (e.g.a stain specific for the nucleus such as DAPI). Cell nuclei may also beisolated from cells, the contents of which may then be analyzed by anysuitable process for detecting protein, such as immunohistochemistry,Western blot, or enzyme activity assay. Accumulation in the nucleus mayalso be determined indirectly, such as by an assay for the effect of thenucleic acid-guided nuclease complex formation (e.g. assay for DNAcleavage or mutation at the target sequence, or assay for altered geneexpression activity affected by targetable nuclease complex formationand/or nucleic acid-guided nuclease activity), as compared to a controlnot exposed to the nucleic acid-guided nuclease or targetable nucleasecomplex, or exposed to a nucleic acid-guided nuclease lacking the one ormore NLS s.

A nucleic acid-guided nuclease and one or more guide nucleic acids canbe delivered either as DNA or RNA. Delivery of an nucleic acid-guidednuclease and guide nucleic acid both as RNA (unmodified or containingbase or backbone modifications) molecules can be used to reduce theamount of time that the nucleic acid-guided nuclease persist in thecell. This may reduce the level of off-target cleavage activity in thetarget cell. Since delivery of a nucleic acid-guided nuclease as mRNAtakes time to be translated into protein, it might be advantageous todeliver the guide nucleic acid several hours following the delivery ofthe nucleic acid-guided nuclease mRNA, to maximize the level of guidenucleic acid available for interaction with the nucleic acid-guidednuclease protein. In other cases, the nucleic acid-guided nuclease mRNAand guide nucleic acid are delivered concomitantly. In other examples,the guide nucleic acid is delivered sequentially, such as 0.5, 1, 2, 3,4, or more hours after the nucleic acid-guided nuclease mRNA.

In situations where guide nucleic acid amount is limiting, it may bedesirable to introduce a nucleic acid-guided nuclease as mRNA and guidenucleic acid in the form of a DNA expression cassette with a promoterdriving the expression of the guide nucleic acid. This way the amount ofguide nucleic acid available will be amplified via transcription.

Guide nucleic acid in the form of RNA or encoded on a DNA expressioncassette can be introduced into a host cell comprising an nucleicacid-guided nuclease encoded on a vector or chromosome. The guidenucleic acid may be provided in the cassette one or morepolynucleotides, which may be contiguous or non-contiguous in thecassette. In specific embodiments, the guide nucleic acid is provided inthe cassette as a single contiguous polynucleotide.

A variety of delivery systems can be used to introduce a nucleicacid-guided nuclease (DNA or RNA) and guide nucleic acid (DNA or RNA)into a host cell. These include the use of yeast systems, lipofectionsystems, microinjection systems, biolistic systems, virosomes,liposomes, immunoliposomes, polycations, lipid:nucleic acid conjugates,virions, artificial virions, viral vectors, electroporation, cellpermeable peptides, nanoparticles, nanowires (Shalek et al., NanoLetters, 2012), exosomes. Molecular trojan horses liposomes (Pardridgeet al., Cold Spring Harb Protoc; 2010; doi:10.1101/pdb.prot5407) may beused to deliver an engineered nuclease and guide nuclease across theblood brain barrier.

In some embodiments, a editing template is also provided. A editingtemplate may be a component of a vector as described herein, containedin a separate vector, or provided as a separate polynucleotide, such asan oligonucleotide, linear polynucleotide, or synthetic polynucleotide.In some cases, a editing template is on the same polynucleotide as aguide nucleic acid. In some embodiments, a editing template is designedto serve as a template in homologous recombination, such as within ornear a target sequence nicked or cleaved by a nucleic acid-guidednuclease as a part of a complex as disclosed herein. A editing templatepolynucleotide may be of any suitable length, such as about or more thanabout 10, 15, 20, 25, 50, 75, 100, 150, 200, 500, 1000, or morenucleotides in length. In some embodiments, the editing templatepolynucleotide is complementary to a portion of a polynucleotidecomprising the target sequence. When optimally aligned, a editingtemplate polynucleotide might overlap with one or more nucleotides of atarget sequences (e.g. about or more than about 1, 5, 10, 15, 20, 25,30, 35, 40, or more nucleotides). In some embodiments, when a editingtemplate sequence and a polynucleotide comprising a target sequence areoptimally aligned, the nearest nucleotide of the template polynucleotideis within about 1, 5, 10, 15, 20, 25, 50, 75, 100, 200, 300, 400, 500,1000, 5000, 10000, or more nucleotides from the target sequence.

In many examples, an editing template comprises at least one mutationcompared to the target sequence. An editing template can comprise aninsertion, deletion, modification, or any combination thereof comparedto the target sequence. Examples of some editing templates are describedin more detail in a later section.

In some aspects, the invention provides methods comprising deliveringone or more polynucleotides, such as or one or more vectors or linearpolynucleotides as described herein, one or more transcripts thereof,and/or one or proteins transcribed therefrom, to a host cell. In someaspects, the invention further provides cells produced by such methods,and organisms comprising or produced from such cells. In someembodiments, an engineered nuclease in combination with (and optionallycomplexed with) a guide nucleic acid is delivered to a cell.

Conventional viral and non-viral based gene transfer methods can be usedto introduce nucleic acids in cells, such as prokaryotic cells,eukaryotic cells, mammalian cells, or target tissues. Such methods canbe used to administer nucleic acids encoding components of an engineerednucleic acid-guided nuclease system to cells in culture, or in a hostorganism. Non-viral vector delivery systems include DNA plasmids, RNA(e.g. a transcript of a vector described herein), naked nucleic acid,and nucleic acid complexed with a delivery vehicle, such as a liposome.Viral vector delivery systems include DNA and RNA viruses, which haveeither episomal or integrated genomes after delivery to the cell. For areview of gene therapy procedures, see Anderson, Science 256:808-813(1992); Nabel & Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey,TIBTECH 11:162-166 (1993); Dillon. TIBTECH 11:167-175 (1993); Miller,Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154(1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995);Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995);Haddada et al., in Current Topics in Microbiology and ImmunologyDoerfler and Bohm (eds) (1995); and Yu et al., Gene Therapy 1:13-26(1994).

Methods of non-viral delivery of nucleic acids include lipofection,microinjection, biolistics, virosomes, liposomes, immunoliposomes,polycation or lipid:nucleic acid conjugates, naked DNA, artificialvirions, and agent-enhanced uptake of DNA. Lipofection is described ine.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) andlipofection reagents are sold commercially (e.g., Transfectam™ andLipofectin™). Cationic and neutral lipids that are suitable forefficient receptor-recognition lipofection of polynucleotides includethose of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells(e.g. in vitro or ex vivo administration) or target tissues (e.g. invivo administration).

The preparation of lipid:nucleic acid complexes, including targetedliposomes such as immunolipid complexes, is well known to one of skillin the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese etal., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem.5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gaoet al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res.52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871,4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).

The use of RNA or DNA viral based systems for the delivery of nucleicacids take advantage of highly evolved processes for targeting a virusto specific cells in culture or in the host and trafficking the viralpayload to the nucleus or host cell genome. Viral vectors can beadministered directly to cells in culture, patients (in vivo), or theycan be used to treat cells in vitro, and the modified cells mayoptionally be administered to patients (ex vivo). Conventional viralbased systems could include retroviral, lentivirus, adenoviral,adeno-associated and herpes simplex virus vectors for gene transfer.Integration in the host genome is possible with the retrovirus,lentivirus, and adeno-associated virus gene transfer methods, oftenresulting in long term expression of the inserted transgene.Additionally, high transduction efficiencies have been observed in manydifferent cell types and target tissues.

The tropism of a retrovirus can be altered by incorporating foreignenvelope proteins, expanding the potential target population of targetcells. Lentiviral vectors are retroviral vectors that are able totransduce or infect non-dividing cells and typically produce high viraltiters. Selection of a retroviral gene transfer system would thereforedepend on the target tissue. Retroviral vectors are comprised ofcis-acting long terminal repeats with packaging capacity for up to 6-10kb of foreign sequence. The minimum cis-acting LTRs are sufficient forreplication and packaging of the vectors, which are then used tointegrate the therapeutic gene into the target cell to provide permanenttransgene expression. Widely used retroviral vectors include those basedupon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV),Simian Immuno deficiency virus (SIV), human immuno deficiency virus(HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol.66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992);Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol.63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991);PCT/US94/05700).

In applications where transient expression is preferred, adenoviralbased systems may be used. Adenoviral based vectors are capable of veryhigh transduction efficiency in many cell types and do not require celldivision. With such vectors, high titer and levels of expression havebeen obtained. This vector can be produced in large quantities in arelatively simple system.

Adeno-associated virus (“AAV”) vectors may also be used to transducecells with target nucleic acids, e.g., in the in vitro production ofnucleic acids and peptides, and for in vivo and ex vivo gene therapyprocedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat.No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994);Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinantAAV vectors are described in a number of publications, including U.S.Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260(1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat& Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol.63:03822-3828 (1989).

In some embodiments, a host cell is transiently or non-transientlytransfected with one or more vectors, linear polynucleotides,polypeptides, nucleic acid-protein complexes, or any combination thereofas described herein. In some embodiments, a cell in transfected invitro, in culture, or ex vivo. In some embodiments, a cell istransfected as it naturally occurs in a subject. In some embodiments, acell that is transfected is taken from a subject. In some embodiments,the cell is derived from cells taken from a subject, such as a cellline.

In some embodiments, a cell transfected with one or more vectors, linearpolynucleotides, polypeptides, nucleic acid-protein complexes, or anycombination thereof as described herein is used to establish a new cellline comprising one or more transfection-derived sequences. In someembodiments, a cell transiently transfected with the components of anengineered nucleic acid-guided nuclease system as described herein (suchas by transient transfection of one or more vectors, or transfectionwith RNA), and modified through the activity of an engineered nucleasecomplex, is used to establish a new cell line comprising cellscontaining the modification but lacking any other exogenous sequence.

In some embodiments, one or more vectors described herein are used toproduce a non-human transgenic cell, organism, animal, or plant. In someembodiments, the transgenic animal is a mammal, such as a mouse, rat, orrabbit. Methods for producing transgenic cells, organisms, plants, andanimals are known in the art, and generally begin with a method of celltransformation or transfection, such as described herein.

Methods of Use

In the context of formation of an engineered nuclease complex, “targetsequence” refers to a sequence to which a guide sequence is designed tohave complementarity, where hybridization between a target sequence anda guide sequence promotes the formation of a engineered nucleasecomplex. A target sequence may comprise any polynucleotide, such as DNA,RNA, or a DNA-RNA hybrid. A target sequence can be located in thenucleus or cytoplasm of a cell. A target sequence can be located invitro or in a cell-free environment.

Typically, formation of an engineered nuclease complex comprising aguide nucleic acid hybridized to a target sequence and complexed withone or more engineered nucleases as disclosed herein results in cleavageof one or both strands in or near (e.g. within 1, 2, 3, 4, 5, 6, 7, 8,9, 10, 20, 50, or more base pairs from) the target sequence. Cleavagecan occur within a target sequence, 5′ of the target sequence, upstreamof a target sequence, 3′ of the target sequence, or downstream of atarget sequence.

In some embodiments, one or more vectors driving expression of one ormore components of a targetable nuclease system are introduced into ahost cell or in vitro such formation of a targetable nuclease complex atone or more target sites. For example, a nucleic acid-guided nucleaseand a guide nucleic acid could each be operably linked to separateregulatory elements on separate vectors. Alternatively, two or more ofthe elements expressed from the same or different regulatory elements,may be combined in a single vector, with one or more additional vectorsproviding any components of the targetable nuclease system not includedin the first vector. Targetable nuclease system elements that arecombined in a single vector may be arranged in any suitable orientation,such as one element located 5′ with respect to (“upstream” of) or 3′with respect to (“downstream” of) a second element. The coding sequenceof one element may be located on the same or opposite strand of thecoding sequence of a second element, and oriented in the same oropposite direction. In some embodiments, a single promoter drivesexpression of a transcript encoding a nucleic acid-guided nuclease andone or more guide nucleic acids. In some embodiments, a nucleicacid-guided nuclease and one or more guide nucleic acids are operablylinked to and expressed from the same promoter. In other embodiments,one or more guide nucleic acids or polynucleotides encoding the one ormore guide nucleic acids are introduced into a cell or in vitroenvironment already comprising a nucleic acid-guided nuclease orpolynucleotide sequence encoding the nucleic acid-guided nuclease.

When multiple different guide sequences are used, a single expressionconstruct may be used to target nuclease activity to multiple different,corresponding target sequences within a cell or in vitro. For example, asingle vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7,8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about ormore than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more suchguide-sequence-containing vectors may be provided, and optionallydelivered to a cell or in vitro.

Methods and compositions disclosed herein may comprise more than oneguide nucleic acid, wherein each guide nucleic acid has a differentguide sequence, thereby targeting a different target sequence. In suchcases, multiple guide nucleic acids can be using in multiplexing,wherein multiple targets are targeted simultaneously. Additionally oralternatively, the multiple guide nucleic acids are introduced into apopulation of cells, such that each cell in a population received adifferent or random guide nucleic acid, thereby targeting multipledifferent target sequences across a population of cells. In such cases,the collection of subsequently altered cells can be referred to as alibrary.

Methods and compositions disclosed herein may comprise multipledifferent nucleic acid-guided nucleases, each with one or more differentcorresponding guide nucleic acids, thereby allowing targeting ofdifferent target sequences by different nucleic acid-guided nucleases.In some such cases, each nucleic acid-guided nuclease can correspond toa distinct plurality of guide nucleic acids, allowing two or more nonoverlapping, partially overlapping, or completely overlappingmultiplexing events.

In some embodiments, the nucleic acid-guided nuclease has DNA cleavageactivity or RNA cleavage activity. In some embodiments, the nucleicacid-guided nuclease directs cleavage of one or both strands at thelocation of a target sequence, such as within the target sequence and/orwithin the complement of the target sequence. In some embodiments, thenucleic acid-guided nuclease directs cleavage of one or both strandswithin about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200,500, or more base pairs from the first or last nucleotide of a targetsequence.

In some embodiments, a nucleic acid-guided nuclease may form a componentof an inducible system. The inducible nature of the system would allowfor spatiotemporal control of gene editing or gene expression using aform of energy. The form of energy may include but is not limited toelectromagnetic radiation, sound energy, chemical energy, light energy,temperature, and thermal energy. Examples of inducible system includetetracycline inducible promoters (Tet-On or Tet-Off), small moleculetwo-hybrid transcription activations systems (FKBP, ABA, etc), or lightinducible systems (Phytochrome, LOV domains, or cryptochorome). In oneembodiment, the nucleic acid-guided nuclease may be a part of a LightInducible Transcriptional Effector (LITE) to direct changes intranscriptional activity in a sequence-specific manner. The componentsof a light inducible system may include a nucleic acid-guided nuclease,a light-responsive cytochrome heterodimer (e.g. from Arabidopsisthaliana), and a transcriptional activation/repression domain. Furtherexamples of inducible DNA binding proteins and methods for their use areprovided in U.S. 61/736,465 and U.S. 61/721,283, which is herebyincorporated by reference in its entirety. An inducible system can betemperature inducible such that the system is turned on or off byincreasing or decreasing the temperature. In some temperature induciblesystems, increasing the temperature turns the system on. In sometemperature inducible systems, increasing the temperature turns thesystem off.

In some aspects, the invention provides for methods of modifying atarget sequence in vitro, or in a prokaryotic or eukaryotic cell, whichmay be in vivo, ex vivo, or in vitro. In some embodiments, the methodcomprises sampling a cell or population of cells such as prokaryoticcells, or those from a human or non-human animal or plant (includingmicro-algae), and modifying the cell or cells. Culturing may occur atany stage in vitro or ex vivo. The cell or cells may even bere-introduced into the host, such as a non-human animal or plant(including micro-algae). For re-introduced cells it is particularlypreferred that the cells are stem cells.

In some embodiments, the method comprises allowing a targetable nucleasecomplex to bind to the target sequence to effect cleavage of said targetsequence, thereby modifying the target sequence, wherein the targetablenuclease complex comprises a nucleic acid-guided nuclease complexed witha guide nucleic acid wherein the guide sequence of the guide nucleicacid is hybridized to a target sequence within a target polynucleotide.

In some aspects, the invention provides a method of modifying expressionof a target polynucleotide in in vitro or in a prokaryotic or eukaryoticcell. In some embodiments, the method comprises allowing an targetablenuclease complex to bind to a target sequence with the targetpolynucleotide such that said binding results in increased or decreasedexpression of said target polynucleotide; wherein the targetablenuclease complex comprises an nucleic acid-guided nuclease complexedwith a guide nucleic acid, and wherein the guide sequence of the guidenucleic acid is hybridized to a target sequence within said targetpolynucleotide. Similar considerations apply as above for methods ofmodifying a target polynucleotide. In fact, these sampling, culturingand re-introduction options apply across the aspects of the presentinvention.

In some aspects, the invention provides kits containing any one or moreof the elements disclosed in the above methods and compositions.Elements may provide individually or in combinations, and may beprovided in any suitable container, such as a vial, a bottle, or a tube.In some embodiments, the kit includes instructions in one or morelanguages, for example in more than one language.

In some embodiments, a kit comprises one or more reagents for use in aprocess utilizing one or more of the elements described herein. Reagentsmay be provided in any suitable container. For example, a kit mayprovide one or more reaction or storage buffers. Reagents may beprovided in a form that is usable in a particular assay, or in a formthat requires addition of one or more other components before use (e.g.in concentrate or lyophilized form). A buffer can be any buffer,including but not limited to a sodium carbonate buffer, a sodiumbicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, aHEPES buffer, and combinations thereof. In some embodiments, the bufferis alkaline. In some embodiments, the buffer has a pH from about 7 toabout 10. In some embodiments, the kit comprises one or moreoligonucleotides corresponding to a guide sequence for insertion into avector so as to operably link the guide sequence and a regulatoryelement. In some embodiments, the kit comprises a editing template.

In some aspects, the invention provides methods for using one or moreelements of a engineered targetable nuclease system. A targetablenuclease complex of the disclosure provides an effective means formodifying a target sequence within a target polynucleotide. A targetablenuclease complex of the disclosure has a wide variety of utilityincluding modifying (e.g., deleting, inserting, translocating,inactivating, activating) a target sequence in a multiplicity of celltypes. As such a targetable nuclease complex of the invention has abroad spectrum of applications in, e.g., biochemical pathwayoptimization, genome-wide studies, genome engineering, gene therapy,drug screening, disease diagnosis, and prognosis. An exemplarytargetable nuclease complex comprises a nucleic acid-guided nuclease asdisclosed herein complexed with a guide nucleic acid, wherein the guidesequence of the guide nucleic acid can hybridize to a target sequencewithin the target polynucleotide. A guide nucleic acid can comprise aguide sequence linked to a scaffold sequence. A scaffold sequence cancomprise one or more sequence regions with a degree of complementaritysuch that together they form a secondary structure. In some cases, theone or more sequence regions are comprised or encoded on the samepolynucleotide. In some cases, the one or more sequence regions arecomprised or encoded on separate polynucleotides.

Provided herein are methods of cleaving a target polynucleotide. Themethod comprises cleaving a target polynucleotide using a targetablenuclease complex that binds to a target sequence within a targetpolynucleotide and effect cleavage of said target polynucleotide.Typically, the targetable nuclease complex of the invention, whenintroduced into a cell, creates a break (e.g., a single or a doublestrand break) in the target sequence. For example, the method can beused to cleave a target gene in a cell, or to replace a wildtypesequence with a modified sequence.

The break created by the targetable nuclease complex can be repaired bya repair processes such as the error prone non-homologous end joining(NHEJ) pathway, the high fidelity homology-directed repair (HDR), or byrecombination pathways. During these repair processes, a editingtemplate can be introduced into the genome sequence. In some methods,the HDR or recombination process is used to modify a target sequence.For example, an editing template comprising a sequence to be integratedflanked by an upstream sequence and a downstream sequence is introducedinto a cell. The upstream and downstream sequences share sequencesimilarity with either side of the site of integration in thechromosome, target vector, or target polynucleotide.

An editing template can be DNA or RNA, e.g., a DNA plasmid, a bacterialartificial chromosome (BAC), a yeast artificial chromosome (YAC), aviral vector, a linear piece of DNA, a PCR fragment, oligonucleotide,synthetic polynucleotide, a naked nucleic acid, or a nucleic acidcomplexed with a delivery vehicle such as a liposome or poloxamer.

An editing template polynucleotide can comprise a sequence to beintegrated (e.g, a mutated gene). A sequence for integration may be asequence endogenous or exogenous to the cell. Examples of a sequence tobe integrated include polynucleotides encoding a protein or a non-codingRNA (e.g., a microRNA). Thus, the sequence for integration may beoperably linked to an appropriate control sequence or sequences.Alternatively, the sequence to be integrated may provide a regulatoryfunction. Sequence to be integrated may be a mutated or variant of anendogenous wildtype sequence. Alternatively, sequence to be integratedmay be a wildtype version of an endogenous mutated sequence.Additionally or alternatively, sequenced to be integrated may be avariant or mutated form of an endogenous mutated or variant sequence.

Upstream and downstream sequences in an editing template polynucleotidecan be selected to promote recombination between the targetpolynucleotide of interest and the editing template polynucleotide. Theupstream sequence can be a nucleic acid sequence having sequencesimilarity with the sequence upstream of the targeted site forintegration. Similarly, the downstream sequence can be a nucleic acidsequence having similarity with the sequence downstream of the targetedsite of integration. The upstream and downstream sequences in an editingtemplate can have 75%, 80%, 85%, 90%, 95%, or 100% sequence identitywith the targeted polynucleotide. Preferably, the upstream anddownstream sequences in the editing template polynucleotide have about95%, 96%, 97%, 98%, 99%, or 100% sequence identity with the targetedpolynucleotide. In some methods, the upstream and downstream sequencesin the editing template polynucleotide have about 99% or 100% sequenceidentity with the targeted polynucleotide.

An upstream or downstream sequence may comprise from about 20 bp toabout 2500 bp, for example, about 50, 100, 200, 300, 400, 500, 600, 700,800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900,2000, 2100, 2200, 2300, 2400, or 2500 bp. In some methods, the exemplaryupstream or downstream sequence has about 15 bp to about 50 bp, about 30bp to about 100 bp, about 200 bp to about 2000 bp, about 600 bp to about1000 bp, or more particularly about 700 bp to about 1000 bp.

In some methods, the editing template polynucleotide may furthercomprise a marker. Such a marker may make it easy to screen for targetedintegrations. Examples of suitable markers include restriction sites,fluorescent proteins, or selectable markers. The exogenouspolynucleotide template of the invention can be constructed usingrecombinant techniques (see, for example, Green and Sambrook et al.,2014 and Ausubel et al., 2017).

In an exemplary method for modifying a target polynucleotide byintegrating an editing template polynucleotide, a double stranded breakis introduced into the genome sequence by an engineered nucleasecomplex, the break can be repaired via homologous recombination using anediting template such that the template is integrated into the targetpolynucleotide. The presence of a double-stranded break can increase theefficiency of integration of the editing template.

Disclosed herein are methods for modifying expression of apolynucleotide in a cell. Some methods comprise increasing or decreasingexpression of a target polynucleotide by using a targetable nucleasecomplex that binds to the target polynucleotide.

In some methods, a target polynucleotide can be inactivated to effectthe modification of the expression in a cell. For example, upon thebinding of a targetable nuclease complex to a target sequence in a cell,the target polynucleotide is inactivated such that the sequence is nottranscribed, the coded protein is not produced, or the sequence does notfunction as the wild-type sequence does. For example, a protein ormicroRNA coding sequence may be inactivated such that the protein is notproduced.

In some methods, a control sequence can be inactivated such that it nolonger functions as a regulatory sequence. As used herein, “regulatorysequence” can refer to any nucleic acid sequence that effects thetranscription, translation, or accessibility of a nucleic acid sequence.Examples of regulatory sequences include, a promoter, a transcriptionterminator, and an enhancer.

An inactivated target sequence may include a deletion mutation (i.e.,deletion of one or more nucleotides), an insertion mutation (i.e.,insertion of one or more nucleotides), or a nonsense mutation (i.e.,substitution of a single nucleotide for another nucleotide such that astop codon is introduced). In some methods, the inactivation of a targetsequence results in “knockout” of the target sequence.

An altered expression of one or more target polynucleotides associatedwith a signaling biochemical pathway can be determined by assaying for adifference in the mRNA levels of the corresponding genes between thetest model cell and a control cell, when they are contacted with acandidate agent. Alternatively, the differential expression of thesequences associated with a signaling biochemical pathway is determinedby detecting a difference in the level of the encoded polypeptide orgene product.

To assay for an agent-induced alteration in the level of mRNAtranscripts or corresponding polynucleotides, nucleic acid contained ina sample is first extracted according to standard methods in the art.For instance, mRNA can be isolated using various lytic enzymes orchemical solutions according to the procedures set forth in Green andSambrook (2014), or extracted by nucleic-acid-binding resins followingthe accompanying instructions provided by the manufacturers. The mRNAcontained in the extracted nucleic acid sample is then detected byamplification procedures or conventional hybridization assays (e.g.Northern blot analysis) according to methods widely known in the art orbased on the methods exemplified herein.

For purpose of this invention, amplification means any method employinga primer and a polymerase capable of replicating a target sequence withreasonable fidelity. Amplification may be carried out by natural orrecombinant DNA polymerases such as TaqGold™, T7 DNA polymerase, Klenowfragment of E. coli DNA polymerase, and reverse transcriptase. Apreferred amplification method is PCR. In particular, the isolated RNAcan be subjected to a reverse transcription assay that is coupled with aquantitative polymerase chain reaction (RT-PCR) in order to quantify theexpression level of a sequence associated with a signaling biochemicalpathway.

Detection of the gene expression level can be conducted in real time inan amplification assay. In one aspect, the amplified products can bedirectly visualized with fluorescent DNA-binding agents including butnot limited to DNA intercalators and DNA groove binders. Because theamount of the intercalators incorporated into the double-stranded DNAmolecules is typically proportional to the amount of the amplified DNAproducts, one can conveniently determine the amount of the amplifiedproducts by quantifying the fluorescence of the intercalated dye usingconventional optical systems in the art. DNA-binding dye suitable forthis application include SYBR green, SYBR blue, DAPI, propidium iodine,Hoeste, SYBR gold, ethidium bromide, acridines, proflavine, acridineorange, acriflavine, fluorcoumanin, ellipticine, daunomycin,chloroquine, distamycin D, chromomycin, homidium, mithramycin, rutheniumpolypyridyls, anthramycin, and the like.

In another aspect, other fluorescent labels such as sequence specificprobes can be employed in the amplification reaction to facilitate thedetection and quantification of the amplified products. Probe-basedquantitative amplification relies on the sequence-specific detection ofa desired amplified product. It utilizes fluorescent, target-specificprobes (e.g., TaqMan™probes) resulting in increased specificity andsensitivity. Methods for performing probe-based quantitativeamplification are well established in the art and are taught in U.S.Pat. No. 5,210,015.

In yet another aspect, conventional hybridization assays usinghybridization probes that share sequence homology with sequencesassociated with a signaling biochemical pathway can be performed.Typically, probes are allowed to form stable complexes with thesequences associated with a signaling biochemical pathway containedwithin the biological sample derived from the test subject in ahybridization reaction. It will be appreciated by one of skill in theart that where antisense is used as the probe nucleic acid, the targetpolynucleotides provided in the sample are chosen to be complementary tosequences of the antisense nucleic acids. Conversely, where thenucleotide probe is a sense nucleic acid, the target polynucleotide isselected to be complementary to sequences of the sense nucleic acid.

Hybridization can be performed under conditions of various stringency,for instance as described herein. Suitable hybridization conditions forthe practice of the present invention are such that the recognitioninteraction between the probe and sequences associated with a signalingbiochemical pathway is both sufficiently specific and sufficientlystable. Conditions that increase the stringency of a hybridizationreaction are widely known and published in the art. See, for example,(Green and Sambrook, et al., (2014); Nonradioactive in SituHybridization Application Manual, Boehringer Mannheim, second edition).The hybridization assay can be formed using probes immobilized on anysolid support, including but are not limited to nitrocellulose, glass,silicon, and a variety of gene arrays. A preferred hybridization assayis conducted on high-density gene chips as described in U.S. Pat. No.5,445,934.

For a convenient detection of the probe-target complexes formed duringthe hybridization assay, the nucleotide probes are conjugated to adetectable label. Detectable labels suitable for use in the presentinvention include any composition detectable by photochemical,biochemical, spectroscopic, immunochemical, electrical, optical orchemical means. A wide variety of appropriate detectable labels areknown in the art, which include fluorescent or chemiluminescent labels,radioactive isotope labels, enzymatic or other ligands. In preferredembodiments, one will likely desire to employ a fluorescent label or anenzyme tag, such as digoxigenin, .beta.-galactosidase, urease, alkalinephosphatase or peroxidase, avidin/biotin complex.

Detection methods used to detect or quantify the hybridization intensitywill typically depend upon the label selected above. For example,radiolabels may be detected using photographic film or a phosphoimager.Fluorescent markers may be detected and quantified using a photodetectorto detect emitted light. Enzymatic labels are typically detected byproviding the enzyme with a substrate and measuring the reaction productproduced by the action of the enzyme on the substrate; and finallycolorimetric labels are detected by simply visualizing the coloredlabel.

An agent-induced change in expression of sequences associated with asignaling biochemical pathway can also be determined by examining thecorresponding gene products. Determining the protein level typicallyinvolves a) contacting the protein contained in a biological sample withan agent that specifically bind to a protein associated with a signalingbiochemical pathway; and (b) identifying any agent:protein complex soformed. In one aspect of this embodiment, the agent that specificallybinds a protein associated with a signaling biochemical pathway is anantibody, preferably a monoclonal antibody.

The reaction can be performed by contacting the agent with a sample ofthe proteins associated with a signaling biochemical pathway derivedfrom the test samples under conditions that will allow a complex to formbetween the agent and the proteins associated with a signalingbiochemical pathway. The formation of the complex can be detecteddirectly or indirectly according to standard procedures in the art. Inthe direct detection method, the agents are supplied with a detectablelabel and unreacted agents may be removed from the complex; the amountof remaining label thereby indicating the amount of complex formed. Forsuch method, it is preferable to select labels that remain attached tothe agents even during stringent washing conditions. It is preferablethat the label does not interfere with the binding reaction. In thealternative, an indirect detection procedure may use an agent thatcontains a label introduced either chemically or enzymatically. Adesirable label generally does not interfere with binding or thestability of the resulting agent:polypeptide complex. However, the labelis typically designed to be accessible to an antibody for an effectivebinding and hence generating a detectable signal.

A wide variety of labels suitable for detecting protein levels are knownin the art. Non-limiting examples include radioisotopes, enzymes,colloidal metals, fluorescent compounds, bioluminescent compounds, andchemiluminescent compounds.

The amount of agent:polypeptide complexes formed during the bindingreaction can be quantified by standard quantitative assays. Asillustrated above, the formation of agent:polypeptide complex can bemeasured directly by the amount of label remained at the site ofbinding. In an alternative, the protein associated with a signalingbiochemical pathway is tested for its ability to compete with a labeledanalog for binding sites on the specific agent. In this competitiveassay, the amount of label captured is inversely proportional to theamount of protein sequences associated with a signaling biochemicalpathway present in a test sample.

A number of techniques for protein analysis based on the generalprinciples outlined above are available in the art. They include but arenot limited to radioimmunoassays, ELISA (enzyme linked immunoradiometricassays), “sandwich” immunoassays, immunoradiometric assays, in situimmunoassays (using e.g., colloidal gold, enzyme or radioisotopelabels), western blot analysis, immunoprecipitation assays,immunofluorescent assays, and SDS-PAGE.

Antibodies that specifically recognize or bind to proteins associatedwith a signaling biochemical pathway are preferable for conducting theaforementioned protein analyses. Where desired, antibodies thatrecognize a specific type of post-translational modifications (e.g.,signaling biochemical pathway inducible modifications) can be used.Post-translational modifications include but are not limited toglycosylation, lipidation, acetylation, and phosphorylation. Theseantibodies may be purchased from commercial vendors. For example,anti-phosphotyrosine antibodies that specifically recognizetyrosine-phosphorylated proteins are available from a number of vendorsincluding Invitrogen and Perkin Elmer. Anti-phosphotyrosine antibodiesare particularly useful in detecting proteins that are differentiallyphosphorylated on their tyrosine residues in response to an ER stress.Such proteins include but are not limited to eukaryotic translationinitiation factor 2 alpha (eIF-2.alpha.). Alternatively, theseantibodies can be generated using conventional polyclonal or monoclonalantibody technologies by immunizing a host animal or anantibody-producing cell with a target protein that exhibits the desiredpost-translational modification.

In practicing a subject method, it may be desirable to discern theexpression pattern of an protein associated with a signaling biochemicalpathway in different bodily tissue, in different cell types, and/or indifferent subcellular structures. These studies can be performed withthe use of tissue-specific, cell-specific or subcellular structurespecific antibodies capable of binding to protein markers that arepreferentially expressed in certain tissues, cell types, or subcellularstructures.

An altered expression of a gene associated with a signaling biochemicalpathway can also be determined by examining a change in activity of thegene product relative to a control cell. The assay for an agent-inducedchange in the activity of a protein associated with a signalingbiochemical pathway will dependent on the biological activity and/or thesignal transduction pathway that is under investigation. For example,where the protein is a kinase, a change in its ability to phosphorylatethe downstream substrate(s) can be determined by a variety of assaysknown in the art. Representative assays include but are not limited toimmunoblotting and immunoprecipitation with antibodies such asanti-phosphotyrosine antibodies that recognize phosphorylated proteins.In addition, kinase activity can be detected by high throughputchemiluminescent assays such as AlphaScreen™ (available from PerkinElmer) and eTag™ assay (Chan-Hui, et al. (2003) Clinical Immunology 111:162-174).

Where the protein associated with a signaling biochemical pathway ispart of a signaling cascade leading to a fluctuation of intracellular pHcondition, pH sensitive molecules such as fluorescent pH dyes can beused as the reporter molecules. In another example where the proteinassociated with a signaling biochemical pathway is an ion channel,fluctuations in membrane potential and/or intracellular ionconcentration can be monitored. A number of commercial kits andhigh-throughput devices are particularly suited for a rapid and robustscreening for modulators of ion channels. Representative instrumentsinclude FLIPR™ (Molecular Devices, Inc.) and VIPR (Aurora Biosciences).These instruments are capable of detecting reactions in over 1000 samplewells of a microplate simultaneously, and providing real-timemeasurement and functional data within a second or even a minisecond.

In practicing any of the methods disclosed herein, a suitable vector canbe introduced to a cell, tissue, organism, or an embryo via one or moremethods known in the art, including without limitation, microinjection,electroporation, sonoporation, biolistics, calcium phosphate-mediatedtransfection, cationic transfection, liposome transfection, dendrimertransfection, heat shock transfection, nucleofection transfection,magnetofection, lipofection, impalefection, optical transfection,proprietary agent-enhanced uptake of nucleic acids, and delivery vialiposomes, immunoliposomes, virosomes, or artificial virions. In somemethods, the vector is introduced into an embryo by microinjection. Thevector or vectors may be microinjected into the nucleus or the cytoplasmof the embryo. In some methods, the vector or vectors may be introducedinto a cell by nucleofection.

A target polynucleotide of a targetable nuclease complex can be anypolynucleotide endogenous or exogenous to the host cell. For example,the target polynucleotide can be a polynucleotide residing in thenucleus of the eukaryotic cell, the genome of a prokaryotic cell, or anextrachromosomal vector of a host cell. The target polynucleotide can bea sequence coding a gene product (e.g., a protein) or a non-codingsequence (e.g., a regulatory polynucleotide or a junk DNA).

Examples of target polynucleotides include a sequence associated with asignaling biochemical pathway, e.g., a signaling biochemicalpathway-associated gene or polynucleotide. Examples of targetpolynucleotides include a disease associated gene or polynucleotide. A“disease-associated” gene or polynucleotide refers to any gene orpolynucleotide which is yielding transcription or translation productsat an abnormal level or in an abnormal form in cells derived from adisease-affected tissues compared with tissues or cells of a non-diseasecontrol. It may be a gene that becomes expressed at an abnormally highlevel; it may be a gene that becomes expressed at an abnormally lowlevel, where the altered expression correlates with the occurrenceand/or progression of the disease. A disease-associated gene also refersto a gene possessing mutation(s) or genetic variation that is directlyresponsible or is in linkage disequilibrium with a gene(s) that isresponsible for the etiology of a disease. The transcribed or translatedproducts may be known or unknown, and may be at a normal or abnormallevel.

Embodiments of the invention also relate to methods and compositionsrelated to knocking out genes, editing genes, altering genes, amplifyinggenes, and repairing particular mutations. Altering genes may also meanthe epigenetic manipulation of a target sequence. This may be thechromatin state of a target sequence, such as by modification of themethylation state of the target sequence (i.e. addition or removal ofmethylation or methylation patterns or CpG islands), histonemodification, increasing or reducing accessibility to the targetsequence, or by promoting 3D folding. It will be appreciated that wherereference is made to a method of modifying a cell, organism, or mammalincluding human or a non-human mammal or organism by manipulation of atarget sequence in a genomic locus of interest, this may apply to theorganism (or mammal) as a whole or just a single cell or population ofcells from that organism (if the organism is multicellular). In the caseof humans, for instance, Applicants envisage, inter alia, a single cellor a population of cells and these may preferably be modified ex vivoand then re-introduced. In this case, a biopsy or other tissue orbiological fluid sample may be necessary. Stem cells are alsoparticularly preferred in this regard. But, of course, in vivoembodiments are also envisaged. And the invention is especiallyadvantageous as to HSCs.

The functionality of a targetable nuclease complex can be assessed byany suitable assay. For example, the components of a targetable nucleasesystem sufficient to form a targetable nuclease complex, including aguide nucleic acid and nucleic acid-guided nuclease, can be provided toa host cell having the corresponding target sequence, such as bytransfection with vectors encoding the components of the engineerednuclease system, followed by an assessment of preferential cleavagewithin the target sequence. Similarly, cleavage of a target sequence maybe evaluated in a test tube by providing the target sequence andcomponents of a targetable nuclease complex. Other assays are possible,and will occur to those skilled in the art. A guide sequence can beselected to target any target sequence. In some embodiments, the targetsequence is a sequence within a genome of a cell. Exemplary targetsequences include those that are unique in the target genome.

Editing Cassette

Disclosed herein are compositions and methods for editing a targetpolynucleotide sequence. Such compositions include polynucleotidescontaining one or more components of targetable nuclease system.Polynucleotide sequences for use in these methods can be referred to asediting cassettes.

An editing cassette can comprise one or more primer sites. Primer sitescan be used to amplify an editing cassette by using oligonucleotideprimers comprising reverse complementary sequences that can hybridize tothe one or more primer sites. An editing cassette can comprise two ormore primer times. Sometimes, an editing cassette comprises a primersite on each end of the editing cassette, said primer sites flanking oneor more of the other components of the editing cassette. Primer sitescan be approximately 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26or more nucleotides in length.

An editing cassette can comprise an editing template as disclosedherein. An editing cassette can comprise an editing sequence. An editingsequence can be homologous to a target sequence. An editing sequence cancomprise at least one mutation relative to a target sequence. An editingsequence often comprises homology region (or homology arms) flanking atleast one mutation relative to a target sequence, such that the flankinghomology regions facilitate homologous recombination of the editingsequence into a target sequence. An editing sequence can comprise anediting template as disclosed herein. For example, the editing sequencecan comprise at least one mutation relative to a target sequenceincluding one or more PAM mutations that mutate or delete a PAM site. Anediting sequence can comprise one or more mutations in a codon ornon-coding sequence relative to a non-editing target site.

A PAM mutation can be a silent mutation. A silent mutation can be achange to at least one nucleotide of a codon relative to the originalcodon that does not change the amino acid encoded by the original codon.A silent mutation can be a change to a nucleotide within a non-codingregion, such as an intron, 5′ untranslated region, 3′ untranslatedregion, or other non-coding region.

A PAM mutation can be a non-silent mutation. Non-silent mutations caninclude a missense mutation. A missense mutation can be when a change toat least one nucleotide of a codon relative to the original codon thatchanges the amino acid encoded by the original codon. Missense mutationscan occur within an exon, open reading frame, or other coding region.

An editing sequence can comprise at least one mutation relative to atarget sequence. A mutation can be a silent mutation or non-silentmutation, such as a missense mutation. A mutation can include aninsertion of one or more nucleotides or base pairs. A mutation caninclude a deletion of one or more nucleotides or base pairs. A mutationcan include a substitution of one or more nucleotides or base pairs fora different one or more nucleotides or base pairs. Inserted orsubstituted sequences can include exogenous or heterologous sequences.

An editing cassette can comprise a polynucleotide encoding a guidenucleic acid sequence. In some cases, the guide nucleic acid sequence isoptionally operably linked to a promoter. A guide nucleic acid sequencecan comprise a scaffold sequence and a guide sequence as describedherein.

An editing cassette can comprise a barcode. A barcode can be a uniqueDNA sequence that corresponds to the editing sequence such that thebarcode can identify the one or more mutations of the correspondingediting sequence. In some examples, the barcode is 15 nucleotides. Thebarcode can comprise less than 10, 10, 11, 12, 13, 14, 15, 16, 17, 18,19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 88, 90, 100,110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more than 200nucleotides. A barcode can be a non-naturally occurring sequence. Anediting cassette comprising a barcode can be a non-naturally occurringsequence.

An editing cassette can comprise one or more of an editing sequence anda polynucleotide encoding a guide nucleic acid optionally operablylinked to a promoter, wherein the editing cassette and guide nucleicacid sequence are flanked by primer sites. An editing cassette canfurther comprise a barcode.

An example of an editing cassette is depicted in FIG. 3. Each editingcassette can be designed to edit a site in a target sequence Sites to betargeted can be coding regions, non-coding regions, functionally neutralsites, or they can be a screenable or selectable marker gene. Homologyregions within the editing sequence flank the one or more mutations ofthe editing cassette and can be inserted into the target sequence byrecombination. Recombination can comprise DNA cleavage, such as by annucleic acid-guided nuclease, and repair via homologous recombination.

Editing cassettes can be generated by chemical synthesis, Gibsonassembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligoextension, in vitro assembly, in vitro oligo assembly, PCR, traditionalligation-based cloning, other known methods in the art, or anycombination thereof.

Trackable sequences, such as barcodes or recorder sequences, can bedesigned in silico via standard code with a degenerate mutation at thetarget codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7,8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25,26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In someexamples, the degenerate mutations can comprise 15 nucleic acid residues(N15).

Homology arms can be added to an editing sequence to allow incorporationof the editing sequence into the desired location via homologousrecombination or homology-driven repair. Homology arms can be added bysynthesis, in vitro assembly, PCR, or other known methods in the art.For example, chemical synthesis, Gibson assembly, SLIC, CPEC, PCA,ligation-free cloning, overlapping oligo extension, in vitro assembly,in vitro oligo assembly, PCR, traditional ligation-based cloning, otherknown methods in the art, or any combination thereof. A homology arm canbe added to both ends of a barcode, recorder sequence, and/or editingsequence, thereby flanking the sequence with two distinct homology arms,for example, a 5′ homology arm and a 3′ homology arm.

A homology arm can comprise sequence homologous to a target sequence. Ahomology arm can comprise sequence homologous to sequence adjacent to atarget sequence. A homology arm can comprise sequence homologous tosequence upstream or downstream of a target sequence. A homology arm cancomprise sequence homologous to sequence within the same gene or openreading frame as a target sequence. A homology arm can comprise sequencehomologous to sequence upstream or downstream of a gene or open readingframe the target sequence is within. A homology arm can comprisesequence homologous to a 5′ UTR or 3′ UTR of a gene or open readingframe within which is a target sequence. A homology arm can comprisesequence homologous to a different gene, open reading frame, promoter,terminator, or nucleic acid sequence than that which the target sequenceis within.

The same 5′ and 3′ homology arms can be added to a plurality of distinctediting sequences, thereby generating a library of unique editingsequences that each have the same targeted insertion site. The same 5′and 3′ homology arms can be added to a plurality of distinct editingtemplates, thereby generating a library of unique editing templates thateach have the same targeted insertion site. In alternative examples,different or a variety of 5′ or 3′ homology arms can be added to aplurality of editing sequences or editing templates.

A barcode library or recorder sequence library comprising flankinghomology arms can be cloned into a vector backbone. In some examples,the barcode comprising flanking homology arms are cloned into an editingcassette. Cloning can occur by chemical synthesis, Gibson assembly,SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, invitro assembly, in vitro oligo assembly, PCR, traditional ligation-basedcloning, other known methods in the art, or any combination thereof.

An editing sequence library comprising flanking homology arms can becloned into a vector backbone. In some examples, the editing sequenceand homology arms are cloned into an editing cassette. Editing cassettescan, in some cases, further comprise a nucleic acid sequence encoding aguide nucleic acid or gRNA engineered to target the desired site ofediting sequence insertion, e.g. the target sequence. Editing cassettescan, in some cases, further comprise a barcode or recorder sequence.Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC,PCA, ligation-free cloning, overlapping oligo extension, in vitroassembly, in vitro oligo assembly, PCR, traditional ligation-basedcloning, other known methods in the art, or any combination thereof.

Gene-wide or genome-wide editing libraries can be cloned into a vectorbackbone. A barcode or recorder sequence library can be inserted orassembled into a second site to generate competent trackable plasmidsthat can embed the recording barcode at a fixed locus while integratingthe editing libraries at a wide variety of user defined sites. Cloningcan occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA,ligation-free cloning, overlapping oligo extension, in vitro assembly,in vitro oligo assembly, PCR, traditional ligation-based cloning, otherknown methods in the art, or any combination thereof.

A guide nucleic acid or sequence encoding the same can be assembled orinserted into a vector backbone first, followed by insertion of anediting sequence and/or cassette. In other cases, an editing sequenceand/or cassette can be inserted or assembled into a vector backbonefirst, followed by insertion of a guide nucleic acid or sequenceencoding the same. In other cases, guide nucleic acid or sequenceencoding the same and an editing sequence and/or cassette aresimultaneous inserted or assembled into a vector. A recorder sequence orbarcode can be inserted before or after any of these steps. In otherwords, it should be understood that there are many possible permutationsto the order in which elements of the disclosure are assembled. Thevector can be linear or circular and can be generated by chemicalsynthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning,overlapping oligo extension, in vitro assembly, in vitro oligo assembly,PCR, traditional ligation-based cloning, other known methods in the art,or any combination thereof.

A nucleic acid molecule can be synthesized which comprises one or moreelements disclosed herein. A nucleic acid molecule can be synthesizedthat comprises an editing cassette. A nucleic acid molecule can besynthesized that comprises a guide nucleic acid. A nucleic acid moleculecan be synthesized that comprises a recorder cassette. A nucleic acidmolecule can be synthesized that comprises a barcode. A nucleic acidmolecule can be synthesized that comprises a homology arm. A nucleicacid molecule can be synthesized that comprises an editing cassette anda guide nucleic acid. A nucleic acid molecule can be synthesized thatcomprises an editing cassette and a barcode. A nucleic acid molecule canbe synthesized that comprises an editing cassette, a guide nucleic acid,and a recorder cassette. A nucleic acid molecule can be synthesized thatcomprises an editing cassette, a recorder cassette, and two guidenucleic acids. A nucleic acid molecule can be synthesized that comprisesa recorder cassette and a guide nucleic acid. In any of these cases, theguide nucleic acid can optionally be operably linked to a promoter. Inany of these cases, the nucleic acid molecule can further include one ormore barcodes.

Synthesis can occur by any nucleic acid synthesis method known in theart. Synthesis can occur by enzymatic nucleic acid synthesis. Synthesiscan occur by chemical synthesis. Synthesis can occur by array-basedsynthesis. Synthesis can occur by solid-phase synthesis orphosphoramidite methods. Synthesis can occur by column or multi-wellmethods. Synthesized nucleic acid molecules can be non-naturallyoccurring nucleic acid molecules.

Software and automation methods can be used for multiplex synthesis andgeneration. For example, software and automation can be used to create10, 10², 10³, 10⁴, 10⁵, 10⁶, or more synthesized polynucleotides,cassettes, or plasmids. An automation method can generate desiredsequences and libraries in rapid fashion that can be processed through aworkflow with minimal steps to produce precisely defined libraries, suchas gene-wide or genome-wide editing libraries.

Polynucleotides or libraries can be generated which comprise two or morenucleic acid molecules or plasmids comprising any combination disclosedherein of recorder sequence, editing sequence, guide nucleic acid, andoptional barcode, including combinations of one or more of any of thepreviously mentioned elements. For example, such a library can compriseat least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300,400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500,5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 10⁴, 10⁵,10⁶, 10⁷, 10⁸, 10⁹, 10¹⁰, or more nucleic acid molecules or plasmids ofthe present disclosure. It should be understood that such a library caninclude any number of nucleic acid molecules or plasmids, even if thespecific number is not explicit listed above.

Trackable plasmid libraries or nucleic acid molecule libraries can besequenced in order to determine the recorder sequence and editingsequence pair that is comprised on each trackable plasmid. In othercases, a known recorder sequence is paired with a known editing sequenceduring the library generation process. Other methods of determining theassociation between a recorder sequence and editing sequence comprisedon a common nucleic acid molecule or plasmid are envisioned such thatthe editing sequence can be identified by identification or sequencingof the recorder sequence.

Methods and compositions for tracking edited episomal libraries that areshuttled between E. coli and other organisms/cell lines are providedherein. The libraries can be comprised on plasmids, Bacterial artificialchromosomes (BACs), Yeast artificial chromosomes (YACs), syntheticchromosomes, or viral or phage genomes. These methods and compositionscan be used to generate portable barcoded libraries in host organisms,such as E. coli. Library generation in such organisms can offer theadvantage of established techniques for performing homologousrecombination. Barcoded plasmid libraries can be deep-sequenced at onesite to track mutational diversity targeted across the remainingportions of the plasmid allowing dramatic improvements in the depth oflibrary coverage.

Any nucleic acid molecule disclosed herein can be an isolated nucleicacid. Isolated nucleic acids may be made by any method known in the art,for example using standard recombinant methods, assembly methods,synthesis techniques, or combinations thereof. In some embodiments, thenucleic acids may be cloned, amplified, assembled, or otherwiseconstructed.

Isolated nucleic acids may be obtained from cellular, bacterial, orother sources using any number of cloning methodologies known in theart. In some embodiments, oligonucleotide probes which selectivelyhybridize, under stringent conditions, to other oligonucleotides or tothe nucleic acids of an organism or cell can be used to isolate oridentify an isolated nucleic acid.

Cellular genomic DNA, RNA, or cDNA may be screened for the presence ofan identified genetic element of interest using a probe based upon oneor more sequences. Various degrees of stringency of hybridization may beemployed in the assay.

High stringency conditions for nucleic acid hybridization are well knownin the art. For example, conditions may comprise low salt and/or hightemperature conditions, such as provided by about 0.02 M to about 0.15 MNaCl at temperatures of about 50° C. to about 70° C. It is understoodthat the temperature and ionic strength of a desired stringency aredetermined in part by the length of the particular nucleic acid(s), thelength and nucleotide content of the target sequence(s), the chargecomposition of the nucleic acid(s), and by the presence or concentrationof formamide, tetramethylammonium chloride or other solvent(s) in ahybridization mixture. Nucleic acids may be completely complementary toa target sequence or may exhibit one or more mismatches.

Nucleic acids of interest may also be amplified using a variety of knownamplification techniques. For instance, polymerase chain reaction (PCR)technology may be used to amplify target sequences directly from DNA,RNA, or cDNA. PCR and other in vitro amplification methods may also beuseful, for example, to clone nucleic acid sequences, to make nucleicacids to use as probes for detecting the presence of a target nucleicacid in samples, for nucleic acid sequencing, or for other purposes.

Isolated nucleic acids may be prepared by direct chemical synthesis bymethods such as the phosphotriester method, or using an automatedsynthesizer. Chemical synthesis generally produces a single strandedoligonucleotide. This may be converted into double stranded DNA byhybridization with a complementary sequence or by polymerization with aDNA polymerase using the single strand as a template.

Recorder

In some example, two editing cassettes can be used together to track agenetic engineering step. For example, one editing cassette can comprisean editing template and an encoded guide nucleic acid, and a secondediting cassette, referred to as a recorder cassette, can comprise anediting template comprising a recorder sequence and an encoded nucleicacid which has a distinct guide sequence compared to that of the firstediting cassette. In such cases, the editing sequence and the recordersequence can be inserted into separate target sequences and determinedby their corresponding guide nucleic acids. A recorder sequence cancomprise a barcode, trackable or traceable sequence, and/or a regulatoryelement operable with a screenable or selectable marker.

Through a multiplexed cloning approach, the recorder cassette can becovalently coupled to at least one editing cassette in a plasmid (e.g.,FIG. 17A, green cassette) to generate plasmid libraries that have aunique recorder and editing cassette combination. This library can besequenced to generate the recorder/edit mapping and used to trackediting libraries across large segments of the target DNA (e.g., FIG.17C). Recorder and editing sequences can be comprised on the samecassette, in which case they are both incorporated into the targetnucleic acid sequence, such as a genome or plasmid, by the samerecombination event. In other examples, the recorder and editingsequences can be comprised on separate cassettes within the sameplasmid, in which case the recorder and editing sequences areincorporated into the target nucleic acid sequence by separaterecombination events, either simultaneously or sequentially.

Methods are provided herein for combining multiplex oligonucleotidesynthesis with recombineering, to create libraries of specificallydesigned and trackable mutations. Screens and/or selections followed byhigh-throughput sequencing and/or barcode microarray methods can allowfor rapid mapping of mutations leading to a phenotype of interest.

Methods and compositions disclosed herein can be used to simultaneouslyengineer and track engineering events in a target nucleic acid sequence.

Such plasmids can be generated using in vitro assembly or cloningtechniques. For example, the plasmids can be generated using chemicalsynthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning,other in vitro oligo assembly techniques, traditional ligation-basedcloning, or any combination thereof.

Such plasmids can comprise at least one recording sequence, such as abarcode, and at least one editing sequence. In most cases, the recordingsequence is used to record and track engineering events. Each editingsequence can be used to incorporate a desired edit into a target nucleicacid sequence. The desired edit can include insertion, deletion,substitution, or alteration of the target nucleic acid sequence. In someexamples, the one or more recording sequence and editing sequences arecomprised on a single cassette comprised within the plasmid such thatthey are incorporated into the target nucleic acid sequence by the sameengineering event. In other examples, the recording and editingsequences are comprised on separate cassettes within the plasmid suchthat they are each incorporated into the target nucleic acid by distinctengineering events. In some examples, the plasmid comprises two or moreediting sequences. For example, one editing sequence can be used toalter or silence a PAM sequence while a second editing sequence can beused to incorporate a mutation into a distinct sequence.

Recorder sequences can be inserted into a site separated from theediting sequence insertion site. The inserted recorder sequence can beseparated from the editing sequence by 1 bp to 1 Mbp. For example, theseparation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be anydiscrete integer between 1 bp and 10 Mbp. In some examples, the maximumdistance of separation depends on the size of the target nucleic acid orgenome.

Recorder sequences can be inserted adjacent to editing sequences, orwithin proximity to the editing sequence. For example, the recordersequence can be inserted outside of the open reading frame within whichthe editing sequence is inserted. Recorder sequence can be inserted intoan untranslated region adjacent to an open reading frame within which anediting sequence has been inserted. The recorder sequence can beinserted into a functionally neutral or non-functional site. Therecorder sequence can be inserted into a screenable or selectable markergene.

In some examples, the target nucleic acid sequence is comprised within agenome, artificial chromosome, synthetic chromosome, or episomalplasmid. In various examples, the target nucleic acid sequence can be invitro or in vivo. When the target nucleic acid sequence is in vivo, theplasmid can be introduced into the host organisms by transformation,transfection, conjugation, biolistics, nanoparticles, cell-permeabletechnologies, or other known methods for DNA delivery, or anycombination thereof. In such examples, the host organism can be aeukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.

The engineering event can comprise recombineering, non-homologous endjoining, homologous recombination, or homology-driven repair. In someexamples, the engineering event is performed in vitro or in vivo.

The methods described herein can be carried out in any type of cell inwhich a targetable nuclease system can function (e.g., target and cleaveDNA), including prokaryotic and eukaryotic cells. In some embodimentsthe cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli).In other embodiments, the cell is a fungal cell, such as a yeast cell,e.g., Saccharomyces spp. In other embodiments, the cell is an algalcell, a plant cell, an insect cell, or a mammalian cell, including ahuman cell.

In some examples, the cell is a recombinant organism. For example, thecell can comprise a non-native targetable nuclease system. Additionallyor alternatively, the cell can comprise recombination system machinery.Such recombination systems can include lambda red recombination system,Cre/Lox, attB/attP, or other integrase systems. Where appropriate, theplasmid can have the complementary components or machinery required forthe selected recombination system to work correctly and efficiently.

Method for genome editing can comprise: (a) introducing a vector thatencodes at least one editing cassette and at least one guide nucleicacid into a first population of cells, thereby producing a secondpopulation of cells comprising the vector; (b) maintaining the secondpopulation of cells under conditions in which a nucleic acid-guidednuclease is expressed or maintained, wherein the nucleic acid-guidednuclease is encoded on the vector, a second vector, on the genome ofcells of the second population of cells, or otherwise introduced intothe cell, resulting in DNA cleavage and incorporation of the editingcassette; (c) obtaining viable cells; and (d) sequencing the target DNAmolecule in at least one cell of the second population of cells toidentify the mutation of at least one codon.

A method for genome editing can comprise: (a) introducing a vector thatencodes at least one editing cassette comprising a PAM mutation asdisclosed herein and at least one guide nucleic acid into a firstpopulation of cells, thereby producing a second population of cellscomprising the vector; (b) maintaining the second population of cellsunder conditions in which a nucleic acid-guided nuclease is expressed ormaintained, wherein the nucleic acid-guided nuclease is encoded on thevector, a second vector, on the genome of cells of the second populationof cells, or otherwise introduced into the cell, resulting in DNAcleavage, incorporation of the editing cassette, and death of cells ofthe second population of cells that do not comprise the PAM mutation,whereas cells of the second population of cells that comprise the PAMmutation are viable; (c) obtaining viable cells; and (d) sequencing thetarget DNA in at least one cell of the second population of cells toidentify the mutation of at least one codon.

Method for trackable genome editing can comprise: (a) introducing avector that encodes at least one editing cassette, at least one recordercassette, and at least two guide nucleic acids into a first populationof cells, thereby producing a second population of cells comprising thevector; (b) maintaining the second population of cells under conditionsin which a nucleic acid-guided nuclease is expressed or maintained,wherein the nucleic acid-guided nuclease is encoded on the vector, asecond vector, on the genome of cells of the second population of cells,or otherwise introduced into the cell, resulting in DNA cleavage andincorporation of the editing and recorder cassettes; (c) obtainingviable cells; and (d) sequencing the recorder sequence of the target DNAmolecule in at least one cell of the second population of cells toidentify the mutation of at least one codon.

In some examples where the plasmid comprises a second editing sequencedesigned to silence a PAM, a method for trackable genome editing cancomprise: (a) introducing a vector that encodes at least one editingcassette, a recorder cassette, and at least two guide nucleic acids intoa first population of cells, thereby producing a second population ofcells comprising the vector; (b) maintaining the second population ofcells under conditions in which a nucleic acid-guided nuclease isexpressed or maintained, wherein the nucleic acid-guided nuclease isencoded on the vector, a second vector, on the genome of cells of thesecond population of cells, or otherwise introduced into the cell,resulting in DNA cleavage, incorporation of the editing and recordercassettes, and death of cells of the second population of cells that donot comprise the PAM mutation, whereas cells of the second population ofcells that comprise the PAM mutation are viable; (c) obtaining viablecells; and (d) sequencing the recorder sequence of the target DNA in atleast one cell of the second population of cells to identify themutation of at least one codon.

In some examples transformation efficiency is determined by using anon-targeting control guide nucleic acid, which allows for validation ofthe recombineering procedure and CFU/ng calculations. In some cases,absolute efficient is obtained by counting the total number of colonieson each transformation plate, for example, by counting both red andwhite colonies from a galK control. In some examples, relativeefficiency is calculated by the total number of successful transformants(for example, white colonies) out of all colonies from a control (forexample, galK control).

The methods of the disclosure can provide, for example, greater than1000× improvements in the efficiency, scale, cost of generating acombinatorial library, and/or precision of such library generation.

The methods of the disclosure can provide, for example, greater than:10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×,1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, orgreater improvements in the efficiency of generating genomic orcombinatorial libraries.

The methods of the disclosure can provide, for example, greater than:10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×,1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, orgreater improvements in the scale of generating genomic or combinatoriallibraries.

The methods of the disclosure can provide, for example, greater than:10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×,1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, orgreater decrease in the cost of generating genomic or combinatoriallibraries.

The methods of the disclosure can provide, for example, greater than:10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×,1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, orgreater improvements in the precision of genomic or combinatoriallibrary generation.

Recursive Tracking for Combinatorial Engineering

Disclosed herein are methods and compositions for iterative rounds ofengineering. Disclosed herein are recursive engineering strategies thatallow implementation of CREATE recording at the single cell levelthrough several serial engineering cycles (e.g., FIG. 18 and FIG. 19).These disclosed methods and compositions can enable search-basedtechnologies that can effectively construct and explore complexgenotypic space. The terms recursive and iterative can be usedinterchangeably.

Combinatorial engineering methods can comprise multiple rounds ofengineering. Methods disclosed herein can comprise 2 or more rounds ofengineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9,10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds ofengineering.

In some examples, during each round of engineering a new recordersequence, such as a barcode, is incorporated at the same locus in nearbysites (e.g., FIG. 18, green bars or FIG. 19, black bars) such thatfollowing multiple engineering cycles to construct combinatorialdiversity throughout the genome (e.g., FIG. 18, green bars or FIG. 19,grey bars) a simple PCR of the recording locus can be used toreconstruct each combinatorial genotype or to confirm that theengineered edit from each round has been incorporated into the targetsite.

Disclosed herein are methods for selecting for successive rounds ofengineering. Selection can occur by a PAM mutation incorporated by anediting cassette. Selection can occur by a PAM mutation incorporated bya recorder cassette. Selection can occur using a screenable, selectable,or counter-selectable marker. Selection can occur by targeting a sitefor editing or recording that was incorporated by a prior round ofengineering, thereby selecting for variants that successfullyincorporated edits and recorder sequences from both rounds or all priorrounds of engineering.

Quantitation of these genotypes can be used for understandingcombinatorial mutational effects on large populations and investigationof important biological phenomena such as epistasis.

Serial editing and combinatorial tracking can be implemented usingrecursive vector systems as disclosed herein. These recursive vectorsystems can be used to move rapidly through the transformationprocedure. In some examples, these systems consist of two or moreplasmids containing orthogonal replication origins, antibiotic markers,and an encoded guide nucleic acids. The encoded guide nucleic acid ineach vector can be designed to target one of the other resistancemarkers for destruction by nucleic acid-guided nuclease-mediatedcleavage. These systems can be used, in some examples, to performtransformations in which the antibiotic selection pressure is switchedto remove the previous plasmid and drive enrichment of the next round ofengineered genomes. Two or more passages through the transformation loopcan be performed, or in other words, multiple rounds of engineering canbe performed. Introducing the requisite recording cassettes and editingcassettes into recursive vectors as disclosed herein can be used forsimultaneous genome editing and plasmid curing in each transformationstep with high efficiencies.

In some examples, the recursive vector system disclosed herein comprises2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In someexamples, the recursive vector system can use a particular plasmid morethan once as long as a distinct plasmid is used in the previous roundand in the subsequent round.

Recursive methods and compositions disclosed herein can be used torestore function to a selectable or screenable element in a targetedgenome or plasmid. The selectable or screenable element can include anantibiotic resistance gene, a fluorescent gene, a unique DNA sequence orwatermark, or other known reporter, screenable, or selectable gene. Insome examples, each successive round of engineering can incorporate afragment of the selectable or screenable element, such that at the endof the engineering rounds, the entire selectable or screenable elementhas been incorporated into the target genome or plasmid. In suchexamples, only those genome or plasmids which have successfullyincorporated all of the fragments, and therefore all of the desiredcorresponding mutations, can be selected or screened for. In this way,the selected or screened cells will be enriched for those that haveincorporated the edits from each and every iterative round ofengineering.

Recursive methods can be used to switch a selectable or screenablemarker between an on and an off position, or between an off and an onposition, with each successive round of engineering. Using such a methodallows conservation of available selectable or screenable markers byrequiring, for example, the use of only one screenable or selectablemarker. Furthermore, short regulatory sequence or start codon ornon-start codons can be used to turn the screenable or selectable markeron and off. Such short sequences can easily fit within a synthesizedcassette or polynucleotide.

One or more rounds of engineering can be performed using the methods andcompositions disclosed herein. In some examples, each round ofengineering is used to incorporate an edit unique from that of previousrounds. Each round of engineering can incorporate a unique recordingsequence. Each round of engineering can result in removal or curing ofthe plasmid used in the previous round of engineering. In some examples,successful incorporation of the recording sequence of each round ofengineering results in a complete and functional screenable orselectable marker or unique sequence combination.

Unique recorder cassettes comprising recording sequences such asbarcodes or screenable or selectable markers can be inserted with eachround of engineering, thereby generating a recorder sequence that isindicative of the combination of edits or engineering steps performed.Successive recording sequences can be inserted adjacent to one another.Successive recording sequences can be inserted within proximity to oneanother. Successive sequences can be inserted at a distance from oneanother.

Successive sequences can be inserted at a distance from one another. Forexample, successive recorder sequences can be inserted and separated by0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37,38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55,56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73,74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92,93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In someexamples, successive recorder sequences are separated by about 10, 50,100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750,800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than1500 bp.

Successive recorder sequences can be separated by any desired number ofbase pairs and can be dependent and limited on the number of successiverecorder sequences to be inserted, the size of the target nucleic acidor target genomes, and/or the design of the desired final recordersequence. For example, if the compiled recorder sequence is a functionalscreenable or selectable marker, than the successive recording sequencescan be inserted within proximity and within the same reading frame fromone another. If the compiled recorder sequence is a unique set ofbarcodes to be identified by sequencing and have no coding sequenceelement, then the successive recorder sequences can be inserted with anydesired number of base pairs separating them. In these cases, theseparation distance can be dependent on the sequencing technology to beused and the read length limit.

While preferred embodiments of the present invention have been shown anddescribed herein, it will be obvious to those skilled in the art thatsuch embodiments are provided by way of example only. Numerousvariations, changes, and substitutions will now occur to those skilledin the art without departing from the invention. It should be understoodthat various alternatives to the embodiments of the invention describedherein may be employed in practicing the invention. It is intended thatthe following claims define the scope of the invention and that methodsand structures within the scope of these claims and their equivalents becovered thereby.

Some Definitions

As used herein the term “wild type” is a term of the art understood byskilled persons and means the typical form of an organism, strain, geneor characteristic as it occurs in nature as distinguished from mutant orvariant forms.

As used herein the term “variant” should be taken to mean the exhibitionof qualities that have a pattern that deviates from what occurs innature.

The terms “orthologue” (also referred to as “ortholog” herein) and“homologue” (also referred to as “homolog” herein) are well known in theart. By means of further guidance, a “homologue” of a protein as usedherein is a protein of the same species which performs the same or asimilar function as the protein it is a homologue of. Homologousproteins may but need not be structurally related, or are only partiallystructurally related. An “orthologue” of a protein as used herein is aprotein of a different species which performs the same or a similarfunction as the protein it is an orthologue of Orthologous proteins maybut need not be structurally related, or are only partially structurallyrelated. Homologs and orthologs may be identified by homology modelling(see, e.g., Greer, Science vol. 228 (1985) 1055, and Blundell et al. EurJ Biochem vol 172 (1988), 513) or “structural BLAST” (Dey F, Cliff ZhangQ, Petrey D, Honig B. Toward a “structural BLAST”: using structuralrelationships to infer function. Protein Sci. 2013 April; 22(4):359-66.doi: 10.1002/pro.2225.).

The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”,“nucleic acid” and “oligonucleotide” are used interchangeably. Theyrefer to a polymeric form of nucleotides of any length, eitherdeoxyribonucleotides or ribonucleotides, or analogs thereof.Polynucleotides may have any three dimensional structure, and mayperform any function, known or unknown. The following are non-limitingexamples of polynucleotides: coding or non-coding regions of a gene orgene fragment, loci (locus) defined from linkage analysis, exons,introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, shortinterfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA),ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides,plasmids, vectors, isolated DNA of any sequence, isolated RNA of anysequence, nucleic acid probes, and primers. The term also encompassesnucleic-acid-like structures with synthetic backbones, see, e.g.,Eckstein, 1991; Baserga et al., 1992; Milligan, 1993; WO 97/03211; WO96/39154; Mata, 1997; Strauss-Soukup, 1997; and Samstag, 1996. Apolynucleotide may comprise one or more modified nucleotides, such asmethylated nucleotides and nucleotide analogs. If present, modificationsto the nucleotide structure may be imparted before or after assembly ofthe polymer. The sequence of nucleotides may be interrupted bynon-nucleotide components. A polynucleotide may be further modifiedafter polymerization, such as by conjugation with a labeling component.

“Complementarity” refers to the ability of a nucleic acid to formhydrogen bond(s) with another nucleic acid sequence by eithertraditional Watson-Crick base pairing or other non-traditional types. Apercent complementarity indicates the percentage of residues in anucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crickbase pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9,10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary).“Perfectly complementary” means that all the contiguous residues of anucleic acid sequence will hydrogen bond with the same number ofcontiguous residues in a second nucleic acid sequence. “Substantiallycomplementary” as used herein refers to a degree of complementarity thatis at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refersto two nucleic acids that hybridize under stringent conditions.

As used herein, “stringent conditions” for hybridization refer toconditions under which a nucleic acid having complementarity to a targetsequence predominantly hybridizes with the target sequence, andsubstantially does not hybridize to non-target sequences. Stringentconditions are generally sequence-dependent, and vary depending on anumber of factors. In general, the longer the sequence, the higher thetemperature at which the sequence specifically hybridizes to its targetsequence. Non-limiting examples of stringent conditions are described indetail in Tijssen (1993). Laboratory Techniques In Biochemistry AndMolecular Biology-Hybridization With Nucleic Acid Probes Part I, SecondChapter “Overview of principles of hybridization and the strategy ofnucleic acid probe assay”, Elsevier, N.Y. Where reference is made to apolynucleotide sequence, then complementary or partially complementarysequences are also envisaged. These are preferably capable ofhybridising to the reference sequence under highly stringent conditions.Generally, in order to maximize the hybridization rate, relativelylow-stringency hybridization conditions are selected: about 20 to 25degrees Celsius. lower than the thermal melting point (Tm). The Tm isthe temperature at which 50% of specific target sequence hybridizes to aperfectly complementary probe in solution at a defined ionic strengthand pH. Generally, in order to require at least about 85% nucleotidecomplementarity of hybridized sequences, highly stringent washingconditions are selected to be about 5 to 15 degrees Celsius lower thanthe Tm. In order to require at least about 70% nucleotidecomplementarity of hybridized sequences, moderately-stringent washingconditions are selected to be about 15 to 30 degrees Celsius lower thanthe Tm. Highly permissive (very low stringency) washing conditions maybe as low as 50 degrees Celsius below the Tm, allowing a high level ofmis-matching between hybridized sequences. Those skilled in the art willrecognize that other physical and chemical parameters in thehybridization and wash stages can also be altered to affect the outcomeof a detectable hybridization signal from a specific level of homologybetween target and probe sequences.

“Hybridization” refers to a reaction in which one or morepolynucleotides react to form a complex that is stabilized via hydrogenbonding between the bases of the nucleotide residues. The hydrogenbonding may occur by Watson Crick base pairing, Hoogstein binding, or inany other sequence specific manner. The complex may comprise two strandsforming a duplex structure, three or more strands forming a multistranded complex, a single self-hybridizing strand, or any combinationof these. A hybridization reaction may constitute a step in a moreextensive process, such as the initiation of PCR, or the cleavage of apolynucleotide by an enzyme. A sequence capable of hybridizing with agiven sequence is referred to as the “complement” of the given sequence.

As used herein, the term “genomic locus” or “locus” (plural loci) is thespecific location of a gene or DNA sequence on a chromosome. A “gene”refers to stretches of DNA or RNA that encode a polypeptide or an RNAchain that has functional role to play in an organism and hence is themolecular unit of heredity in living organisms. For the purpose of thisinvention it may be considered that genes include regions which regulatethe production of the gene product, whether or not such regulatorysequences are adjacent to coding and/or transcribed sequences.Accordingly, a gene includes, but is not necessarily limited to,promoter sequences, terminators, translational regulatory sequences suchas ribosome binding sites and internal ribosome entry sites, enhancers,silencers, insulators, boundary elements, replication origins, matrixattachment sites and locus control regions.

As used herein, “expression of a genomic locus” or “gene expression” isthe process by which information from a gene is used in the synthesis ofa functional gene product. The products of gene expression are oftenproteins, but in non-protein coding genes such as rRNA genes or tRNAgenes, the product is functional RNA. The process of gene expression isused by all known life—eukaryotes (including multicellular organisms),prokaryotes (bacteria and archaea) and viruses to generate functionalproducts to survive. As used herein “expression” of a gene or nucleicacid encompasses not only cellular gene expression, but also thetranscription and translation of nucleic acid(s) in cloning systems andin any other context. As used herein, “expression” also refers to theprocess by which a polynucleotide is transcribed from a DNA template(such as into and mRNA or other RNA transcript) and/or the process bywhich a transcribed mRNA is subsequently translated into peptides,polypeptides, or proteins. Transcripts and encoded polypeptides may becollectively referred to as “gene product.” If the polynucleotide isderived from genomic DNA, expression may include splicing of the mRNA ina eukaryotic cell.

The terms “polypeptide”, “peptide” and “protein” are usedinterchangeably herein to refer to polymers of amino acids of anylength. The polymer may be linear or branched, it may comprise modifiedamino acids, and it may be interrupted by non amino acids. The termsalso encompass an amino acid polymer that has been modified; forexample, disulfide bond formation, glycosylation, lipidation,acetylation, phosphorylation, or any other manipulation, such asconjugation with a labeling component. As used herein the term “aminoacid” includes natural and/or unnatural or synthetic amino acids,including glycine and both the D or L optical isomers, and amino acidanalogs and peptidomimetics.

As used herein, the term “domain” or “protein domain” refers to a partof a protein sequence that may exist and function independently of therest of the protein chain.

As described in aspects of the invention, sequence identity is relatedto sequence homology. Homology comparisons may be conducted by eye, ormore usually, with the aid of readily available sequence comparisonprograms. These commercially available computer programs may calculatepercent (%) homology between two or more sequences and may alsocalculate the sequence identity shared by two or more amino acid ornucleic acid sequences. Sequence homologies may be generated by any of anumber of computer programs known in the art, for example BLAST orFASTA, etc. A suitable computer program for carrying out such analignment is the GCG Wisconsin Bestfit package (University of Wisconsin.U.S.A; Devereux et al., 1984, Nucleic Acids Research 12:387). Examplesof other software than may perform sequence comparisons include, but arenot limited to, the BLAST package (see Ausubel et al., 1999 ibid—Chapter18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403-410) and theGENEWORKS suite of comparison tools. Both BLAST and FASTA are availablefor offline and online searching (see Ausubel et al., 1999 ibid, pages7-58 to 7-60). However it is preferred to use the GCG Bestfit program.

Percent homology may be calculated over contiguous sequences, i.e., onesequence is aligned with the other sequence and each amino acid ornucleotide in one sequence is directly compared with the correspondingamino acid or nucleotide in the other sequence, one residue at a time.This is called an “ungapped” alignment. Typically, such ungappedalignments are performed only over a relatively short number ofresidues.

Although this is a very simple and consistent method, it fails to takeinto consideration that, for example, in an otherwise identical pair ofsequences, one insertion or deletion may cause the following amino acidresidues to be put out of alignment, thus potentially resulting in alarge reduction in % homology when a global alignment is performed.Consequently, most sequence comparison methods are designed to produceoptimal alignments that take into consideration possible insertions anddeletions without unduly penalizing the overall homology or identityscore. This is achieved by inserting “gaps” in the sequence alignment totry to maximize local homology or identity.

However, these more complex methods assign “gap penalties” to each gapthat occurs in the alignment so that, for the same number of identicalamino acids, a sequence alignment with as few gaps aspossible—reflecting higher relatedness between the two comparedsequences—may achieve a higher score than one with many gaps. “Affinitygap costs” are typically used that charge a relatively high cost for theexistence of a gap and a smaller penalty for each subsequent residue inthe gap. This is the most commonly used gap scoring system. High gappenalties may, of course, produce optimized alignments with fewer gaps.Most alignment programs allow the gap penalties to be modified. However,it is preferred to use the default values when using such software forsequence comparisons. For example, when using the GCG Wisconsin Bestfitpackage the default gap penalty for amino acid sequences is −12 for agap and −4 for each extension.

Calculation of maximum % homology therefore first requires theproduction of an optimal alignment, taking into consideration gappenalties. A suitable computer program for carrying out such analignment is the GCG Wisconsin Bestfit package (Devereux et al., 1984Nuc. Acids Research 12 p387). Examples of other software that mayperform sequence comparisons include, but are not limited to, the BLASTpackage (see Ausubel et al., 1999 Short Protocols in Molecular Biology,4th Ed.—Chapter 18), FASTA (Altschul et al., 1990 J. Mol. Biol. 403-410)and the GENEWORKS suite of comparison tools. Both BLAST and FASTA areavailable for offline and online searching (see Ausubel et al., 1999,Short Protocols in Molecular Biology, pages 7-58 to 7-60). However, forsome applications, it is preferred to use the GCG Bestfit program. A newtool, called BLAST 2 Sequences is also available for comparing proteinand nucleotide sequences (see FEMS Microbiol Lett. 1999 174(2): 247-50;FEMS Microbiol Lett. 1999 177(1): 187-8 and the website of the NationalCenter for Biotechnology information at the website of the NationalInstitutes for Health).

Although the final % homology may be measured in terms of identity, thealignment process itself is typically not based on an all-or-nothingpair comparison. Instead, a scaled similarity score matrix is generallyused that assigns scores to each pair-wise comparison based on chemicalsimilarity or evolutionary distance. An example of such a matrixcommonly used is the BLOSUM62 matrix—the default matrix for the BLASTsuite of programs. GCG Wisconsin programs generally use either thepublic default values or a custom symbol comparison table, if supplied(see user manual for further details). For some applications, it ispreferred to use the public default values for the GCG package, or inthe case of other software, the default matrix, such as BLOSUM62.

Alternatively, percentage homologies may be calculated using themultiple alignment feature in DNASIS™ (Hitachi Software), based on analgorithm, analogous to CLUSTAL (Higgins D G & Sharp P M (1988), Gene73(1), 237-244). Once the software has produced an optimal alignment, itis possible to calculate % homology, preferably % sequence identity. Thesoftware typically does this as part of the sequence comparison andgenerates a numerical result.

Sequences may also have deletions, insertions or substitutions of aminoacid residues which produce a silent change and result in a functionallyequivalent substance. Deliberate amino acid substitutions may be made onthe basis of similarity in amino acid properties (such as polarity,charge, solubility, hydrophobicity, hydrophilicity, and/or theamphipathic nature of the residues) and it is therefore useful to groupamino acids together in functional groups. Amino acids may be groupedtogether based on the properties of their side chains alone. However, itis more useful to include mutation data as well. The sets of amino acidsthus derived are likely to be conserved for structural reasons. Thesesets may be described in the form of a Venn diagram (Livingstone C. D.and Barton G. J. (1993) “Protein sequence alignments: a strategy for thehierarchical analysis of residue conservation” Comput. Appl. Biosci. 9:745-756) (Taylor W. R. (1986) “The classification of amino acidconservation” J. Theor. Biol. 119; 205-218). Conservative substitutionsmay be made, for example according to the table below which describes agenerally accepted Venn diagram grouping of amino acids.

Embodiments of the invention include sequences (both polynucleotide orpolypeptide) which may comprise homologous substitution (substitutionand replacement are both used herein to mean the interchange of anexisting amino acid residue or nucleotide, with an alternative residueor nucleotide) that may occur i.e., like-for-like substitution in thecase of amino acids such as basic for basic, acidic for acidic, polarfor polar, etc. Non-homologous substitution may also occur i.e., fromone class of residue to another or alternatively involving the inclusionof unnatural amino acids such as ornithine (hereinafter referred to asZ), diaminobutyric acid ornithine (hereinafter referred to as B),norleucine ornithine (hereinafter referred to as O), pyridylalanine,thienylalanine, naphthylalanine and phenylglycine.

Variant amino acid sequences may include suitable spacer groups that maybe inserted between any two amino acid residues of the sequenceincluding alkyl groups such as methyl, ethyl or propyl groups inaddition to amino acid spacers such as glycine or .beta.-alanineresidues. A further form of variation, which involves the presence ofone or more amino acid residues in peptoid form, may be well understoodby those skilled in the art. For the avoidance of doubt, “the peptoidform” is used to refer to variant amino acid residues wherein the.alpha.-carbon substituent group is on the residue's nitrogen atomrather than the .alpha.-carbon. Processes for preparing peptides in thepeptoid form are known in the art, for example Simon R J et al., PNAS(1992) 89(20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995)13(4), 132-134.

The practice of the present invention employs, unless otherwiseindicated, conventional techniques of immunology, biochemistry,chemistry, molecular biology, microbiology, cell biology, genomics andrecombinant DNA, which are within the skill of the art. See Green andSambrook, (Molecular Cloning: A Laboratory Manual. 4th, ed., Cold SpringHarbor Laboratory Press, Cold Spring Harbor, N.Y., 2014); CURRENTPROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (2017));Short Protocols in Molecular Biology, (Ausubel et al., 1999)); theseries METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICALAPPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)),ANTIBODIES, A LABORATORY MANUAL, SECOND EDITION (Harlow and Lane, eds.(2014) and CULTURE OF ANIMAL CELLS: A MANUAL OF BASIC TECHNIQUE, 7THEDITION R. I. Freshney, ed. (2016)).

EXAMPLES

The following examples are given for the purpose of illustrating variousembodiments of the invention and are not meant to limit the presentinvention in any fashion. The present examples, along with the methodsdescribed herein are presently representative of preferred embodiments,are exemplary, and are not intended as limitations on the scope of theinvention. Changes therein and other uses which are encompassed withinthe spirit of the invention as defined by the scope of the claims willoccur to those skilled in the art.

Example 1. Nucleic Acid-Guided Nucleases

Sequences for twenty nucleic acid guided nucleases, termed MAD1-MAD20(SEQ ID NOs 1-20), were aligned and compared to other nucleic acidguided nucleases. A partial alignment and phylogenetic tree are depictedin FIG. 1A and FIG. 1B respectively. Key residues in that may beinvolved in the recognition of a PAM site are shown in FIG. 1A. Theseinclude amino acids at positions 167, 539, 548, 599, 603, 604, 605, 606,and 607.

Sequence alignments were built using PSI-BLAST to search for MADnuclease homologs in the NCBI non-redundant databases. Multiple sequencealignments were further refined using the MUSCLE alignment algorithmwith default settings as implemented in Geneious 10. The percentidentity of each homolog to SpCas9 and AsCpf1 reference sequences werecomputed based on the pairwise alignment matching from these globalalignments.

Genomic source sequences were identified using Uniprot linkageinformation or TBLASTN searches of NCBI using the default parameters andsearching all possible frames for translational matches.

Percent identities of MAD1-8 and 10-12 to other various nuclease aresummarized in Table 1. These percent identities represent the sharedamino acid sequence identity between the indicated proteins.

TABLE 1 Protein identifier or accession number MAD1 MAD2 MAD3 MAD4 MAD5MAD6 MAD7 MAD8 MAD10 MAD11 MAD12 gi|1025734861|pdb|5B43|A 6.4 32.8 33.229.7 29.4 31.1 30.3 31.7 26.7 27.9 98.8 gi|1052245173|pdb|5KK5|A 6.432.7 33.1 29.7 29.3 31 30.3 31.7 26.7 27.8 98.7gi|1086216683|emb|SDC16215.1| 6.1 33 34.4 29.6 30.1 33.5 32.3 32.1 26.227.2 46.8 gi|1120175333|ref|WP_073043853.1| 5.9 30.9 37.2 32.8 33.6 34.435.7 35.1 26.3 28.3 34.9 Cpf1.Sj|WP_081839471 6.6 33.6 41.7 37.2 33.437.6 40.1 37.7 29.1 30.3 34.1 Cpf1.Ss|KFO67989 6.9 32.3 35.7 43 33.745.9 34.8 48 33.2 33.4 33.8 MAD3 5.8 31 100 32.9 35.9 35 35.6 34.3 2827.6 33.1 gi|1082474576|gb|OFY19591.1| 7 31.4 35.9 43.2 31.4 45 33.648.6 30.8 33.5 33 MAD2 6.1 100 31 30.7 30.2 31 31.2 31.2 25.8 27.7 32.6Cpf1.Lb5|WP_016301126 7.8 32.8 36.5 38.2 34.2 45.5 35.8 43.6 30.7 35.732.5 gi|1088286736|gb|OHB41002.1| 6.7 30.6 35.3 42.4 33.2 44.7 32.1 46.830.7 32.6 32.4 gi|1094423310|emb|SER03894.1| 6.8 30.8 36.1 40.4 31.850.4 35.2 46.6 30.4 36.8 32.3 gi|493326531|ref|WP_006283774.1| 6.8 30.836.1 40.3 31.8 50.3 35.1 46.6 30.4 36.8 32.3 MAD8 7.6 31.2 34.3 40.4 3241.6 32.8 100 30.1 32.1 31.7 Cpf1.Bot|WP_009217842 6.9 30.1 36.6 41.532.5 50.2 35.4 45.5 29.8 34.1 31.6 Cpf1.Li|WP_020988726 7.3 30.2 34.639.3 30.3 40.7 31.8 39.4 32.1 31.3 31.5 Cpf1.Pb|WP_044110123 6.3 31.431.8 36.1 30.8 45.7 30.4 39.4 27.7 33.5 31.5 gi|817911372|gb|AKG08867.1|7.3 29.8 35 40.7 32.1 40.3 32.6 41.7 29.1 31 31.4gi|1052838533|emb|SCH45297.1| 6.6 30.8 35.5 32 31.5 34.4 51.9 33.4 26.129 31.3 gi|1053713332|ref|WP_066040075.1| 7.2 29.6 33.2 39.6 29.8 49.132.2 41.4 30.1 32.4 31.3 gi|817909002|gb|AKG06878.1| 7.3 29.8 35 40.7 3240.3 32.5 41.6 29.1 30.9 31.3 gi|1042201477|ref|WP_065256572.1| 7.2 29.535.2 40.6 31.9 40.1 32.7 41.6 29 30.8 31.2 MAD6 7.5 31 35 38.9 33.1 10034.3 41.6 30.5 33.6 31 gi|490468773|ref|WP_004339290.1| 6.8 31.8 31.736.2 28.6 36.5 31.4 38.4 28.5 31.4 31 gi|565853704|ref|WP_023936172.1|7.5 30.8 34.9 38.9 33.1 99.7 34.1 41.6 30.4 33.6 31gi|739005707|ref|WP_036887416.1| 7.5 30.9 35 38.9 33 99.9 34.2 41.5 30.433.5 31 gi|739008549|ref|WP_036890108.1| 7.5 31 35 38.8 33 99.8 34.241.5 30.4 33.5 31 Cpf1.Ft|WP_014550095 7.1 31.9 33.8 40.3 29.7 39.4 34.141 29.8 32.5 30.8 gi|504362993|ref|WP_014550095.1| 7.2 32.4 33.8 40.329.6 39.4 33.8 40.9 30.1 32.5 30.8 gi|640557447|ref|WP_024988992.1| 6.631.4 34.8 40.7 31.2 48 34.1 45.1 28.8 35.2 30.8gi|1098944113|ref|WP_071304624.1| 7.1 32.3 33.5 40.3 29.6 39.2 33.8 40.930.1 32.5 30.6 gi|489124848|ref|WP_003034647.1| 7.1 32.3 33.9 40.9 29.939.2 33.9 40.9 29.9 32.2 30.6 gi|738967776|ref|WP_036851563.1| 6.8 29.433.1 35.5 28.9 40.3 30.7 35.9 28.7 31.3 30.5 MAD7 5.9 31.2 35.6 30.833.9 34.3 100 32.8 24.2 28.9 30.5 Cpf1.Lb6|WP_044910713 6.7 29.8 33.736.6 30.9 43 34 39.8 29.1 32.1 30.4 gi|1052961977|emb|SCH47915.1| 5.530.5 35.8 32.3 34 35 53.8 33.4 26.2 27.4 30.4gi|817918353|gb|AKG14689.1| 7 29.1 34.4 39.8 31.7 40 32.4 41.1 28.4 30.130.3 gi|917059416|ref|WP_051666128.1| 6.9 29.9 31.5 35.7 31.6 41.8 32.939.1 30.1 34 30.2 gi|1011649201|ref|WP_062499108.1| 6.8 29 34.7 40.331.4 40.1 33.1 41.6 28.5 30.4 30.1 Cpf1.PmWP_018359861 6.3 29.2 32.334.2 27.4 38.7 29.4 35 27.2 30.1 30 gi|817922537|gb|AKG18099.1| 6.8 29.134.5 39.6 31.5 39.9 32.7 40.7 28.3 29.8 30gi|769142322|ref|WP_044919442.1| 6.7 31 34.6 37.8 31.5 41.4 33.3 39.2 2831.9 29.9 gi|1023176441|pdb|5ID6|A 6.7 29.7 31.3 35.5 31.3 41 32.6 38.529.7 33.3 29.8 gi|491540987|ref|WP_005398606.1| 5.9 28.3 30.4 29.7 28.529 30.7 29.8 25.8 27.8 29.8 gi|652820612|ref|WP_027109509.1| 6.4 31.1 3435.3 31.7 40.3 33.4 37.5 28.5 33.3 29.8 gi|502240446|ref|WP_012739647.1|5.9 31.6 36.1 31.2 33 35.4 49.4 34 26.6 29.4 29.7gi|524278046|emb|CDA41776.1| 5.8 31.6 36 31 33 35.4 50 34 26.6 29.5 29.7gi|737831580|ref|WP_035798880.1| 6.2 31.3 34.8 38.1 31.5 42.1 33 39.628.4 32.4 29.7 gi|909652572ref|WP_049895985.1| 6.9 30.7 34.2 37.2 30.841.5 34.2 38.7 28 32 29.7 MAD4 6.7 30.7 32.9 100 30.7 38.9 30.8 40.428.8 29.4 29.7 gi|942073 049|ref|WP_055286279.1| 5.9 31.6 36.1 31.1 32.735 49.7 33.9 27.1 29.5 29.6 gi|654794505|ref|WP_028248456.1| 7.4 30.535.9 37.4 31.3 42.8 34.2 40.2 27.9 33.5 29.5gi|933014786|emb|CUO47728.1| 5.6 31.3 34.9 31.2 31.5 32.4 46.7 30.6 25.427.7 29.4 gi|941887450|ref|WP_055224182.1| 5.6 31.4 35 31.3 31.6 32.546.6 30.7 25.3 27.8 29.4 gi|920071674|ref|WP_052943011.1| 6.3 31 31.838.8 31.8 41.3 33.8 42.6 29.8 34.7 29 MAD5 5.1 30.2 35.9 30.7 100 33.133.9 32 24.3 28.7 29 gi|1081462674|emb|SCZ76797.1| 6.9 30.4 33.5 34.729.7 40.1 30.5 37.4 27.3 32.5 28.9 gi|918722523|ref|WP_052585281.1| 7.427.5 30.5 35.7 28.3 35.2 28.5 36 26 27.1 28.8gi|524816323|emb|CDF09621.1| 6.2 30 34.1 29.3 31.2 32.7 47.6 32.2 25.525.9 28.4 gi|941782328|ref|WP_055176369.1| 6.2 30.2 33.1 28.9 30.9 3246.9 32.1 26 27.1 28.4 gi|942113296|ref|WP_055306762.1| 6.4 29.8 33.829.7 31.3 33.1 48 32.5 25.8 26.2 28.4 MAD11 6.4 27.7 27.6 29.4 28.7 33.628.9 32.1 26.2 100 27.8 gi|653158548ref|WP_027407524.1| 5.9 26.4 28.133.5 27.4 32.5 27.8 32 27 26.8 27.6 gi|652963004|ref|WP_027216152.1| 6.630.3 32.5 33.2 30.4 38.2 29.6 34.6 25.9 30.5 27.2gi|1083069650|gb|OGD68774. 1| 6.2 25 24.3 26.6 23.1 28.1 23.2 26.4 4524.9 27.1 gi|302483275|gb|EFL46285.1| 5.6 24.7 26.8 30.3 24.9 34.8 2630.4 24.4 27.5 27.1 gi|915400855|ref|WP_050786240.1| 5.6 24.7 26.8 30.324.9 34.8 26 30.4 24.4 27.5 27.1 MAD10 5.6 25.8 28 28.8 24.3 30.5 24.230.1 100 26.2 26.6 gi|1101117967|gb|OIO75780.1| 6.1 26.8 26 27.3 24.328.1 24.4 28.2 44.1 25.4 26.1 gi|1088204458|gb|OHA63117.1| 6.5 25.2 23.525.8 22.9 27 22 26.1 36.5 24.2 24.7 gi|809198071|ref|WP_046328599.1| 4.925.6 26.5 22.2 23.9 23.8 25.8 23.9 20.3 25.1 24gi|1088079929|gb|OGZ45678.1| 5.6 21.9 23.8 26.9 23.4 27.8 23.3 26.7 28.824.7 23.5 gi|1101053499|gb|OIO15737.1| 5.9 23.1 26.2 25.2 23 26.4 25.126.5 29.2 23.2 23.4 gi|1101058058|gb|OIO19978.1| 5.4 21.2 22.8 23.6 20.625 20.7 25 25.9 22.2 23 gi|1088000848|gb|OGY73485.1| 5.7 23.5 25.2 25.523.9 27 25.1 25.6 31.6 23.6 22.9 gi|407014433|gb|EKE28449.1| 5.2 23.525.9 26.7 24.3 25.8 23 27.8 29.9 25.3 22.9 gi|818249855|gb|KKP36646.1| 621 20.7 23.5 20 24.2 21 24 24.6 21.8 22.6 gi|818703647|gb|KKT48220.1|5.8 23.3 25 25.1 23.5 26.5 24.7 25.3 31.2 23.3 22.6gi|818705786|gb|KKT50231.1| 5.8 23.1 24.6 24.7 22.9 26.2 24.2 24.8 30.822.9 22.2 gi|1083950632|gb|OGJ66851.1| 4.5 20 22.1 23.5 20.6 24.6 20 2423.5 20.7 22.1 gi|1083932199|gb|OGJ49885.1| 6 20.4 20.2 22.6 19.3 23.320.6 23.2 23.9 21 21.8 gi|1083410735|gb|OGF20863.1| 5 21.7 23.3 25.5 2325 22.7 25.9 27.2 22.4 21.5 gi|1011480927|ref|WP_062376669.1| 4.7 20.120.1 21.4 19.3 23.3 21.4 22 20.2 19.7 20.9 gi|818539593|gb|KKR91555.1|5.1 19.8 21.6 22.1 20.5 22.9 21.2 22.8 24 20.5 19.9gi|503048015|ref|WP_013282991.1| 5.1 18.8 20.7 15.3 19.7 18.9 19.3 17.715.9 19 19.2 gi|1096232746|ref|WP_071177645.1| 5 19.1 20.5 17.4 20.119.7 20.4 20.4 17.5 18.5 18.9 gi|769130404|ref|WP_044910712.1| 4.6 19.418.2 16.1 18.1 17.1 18.7 17.9 14.5 16.8 17.5gi|1085569500|gb|OGX23684.1| 2.6 11.6 12.1 12.7 10.2 12.1 12.7 11.6 10.911.1 10.5 gi|818357062|gb|KKQ38176.1| 3.3 10 11.1 10.6 11.1 11.8 12.111.5 12.2 10.8 9.8 gi|745626763|gb|KIE18642.1| 3.7 9.4 11.7 11.1 11.112.5 11.9 11.9 10.2 10.6 8.8 MAD1 100 6.1 5.8 6.7 5.1 7.5 5.9 7.6 5.66.4 6.4 SpCas9 4 6.3 6.5 8.3 5.6 8.1 6.9 7.7 6.9 6.3 6.3 MAD12 6.4 32.633.1 29.7 29 31 30.5 31.7 26.6 27.8 100

Example 2: Expression of MAD Nucleases

Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs21-40, respectively. These MAD nucleases were codon optimized forexpression in E. coli and the codon optimized sequences are listed asSEQ ID NO: 41-60, respectively (summarized in Table 2).

Codon optimized MAD1-MAD20 were cloned into an expression constructcomprising a constitutive or inducible promoter (eg., proB promoter SEQID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and anoptional 6×-His tag (SEQ ID NO: 182) (eg., FIG. 2). The generatedMAD1-MAD20 expression constructs are provided as SEQ ID NOs: 61-80,respectively. The expression constructs as depicted in FIG. 2 weregenerated either by restriction/ligation-based cloning or homology-basedcloning.

Example 3. Testing Guide Nucleic Acid Sequences Compatible with MADNucleases

In order to have a functioning targetable nuclease complex, a nucleicacid-guided nuclease and a compatible guide nucleic acid is needed. Todetermine the compatible guide nucleic acid sequence, specifically thescaffold sequence portion of the guide nucleic acid, multiple approacheswere taken. First, scaffold sequences were looked for near theendogenous loci of each MAD nuclease. In some cases, such as with MAD2,no endogenous scaffold sequence was found. Therefore, we tested thecompatibility of MAD2 with scaffold sequences found near the endogenousloci of the other MAD nucleases. A list of the MAD nucleases andcorresponding endogenous scaffold sequences that were tested is listedin Table 2.

TABLE 2 Endogenous Codon optimized scaffold sequence WT nucleic acidnucleic acid Amino acid for guide nucleic MAD nuclease sequence sequencesequence acid MAD1 SEQ ID NO: 21 SEQ ID NO: 41 SEQ ID NO: 1 SEQ ID NO:84 MAD2 SEQ ID NO: 22 SEQ ID NO: 42 SEQ ID NO: 2 None identified MAD3SEQ ID NO: 23 SEQ ID NO: 43 SEQ ID NO: 3 SEQ ID NO: 86 MAD4 SEQ ID NO:24 SEQ ID NO: 44 SEQ ID NO: 4 SEQ ID NO: 87 MAD5 SEQ ID NO: 25 SEQ IDNO: 45 SEQ ID NO: 5 SEQ ID NO: 88 MAD6 SEQ ID NO: 26 SEQ ID NO: 46 SEQID NO: 6 SEQ ID NO: 89 MAD7 SEQ ID NO: 27 SEQ ID NO: 47 SEQ ID NO: 7 SEQID NO: 90 MAD8 SEQ ID NO: 28 SEQ ID NO: 48 SEQ ID NO: 8 SEQ ID NO: 91MAD9 SEQ ID NO: 29 SEQ ID NO: 49 SEQ ID NO: 9 SEQ ID NO: 92; SEQ ID NO:103; SEQ ID NO: 106 MAD10 SEQ ID NO: 30 SEQ ID NO: 50 SEQ ID NO: 10 SEQID NO: 93 MAD11 SEQ ID NO: 31 SEQ ID NO: 51 SEQ ID NO: 11 SEQ ID NO: 94MAD12 SEQ ID NO: 32 SEQ ID NO: 52 SEQ ID NO: 12 SEQ ID NO: 95 MAD13 SEQID NO: 33 SEQ ID NO: 53 SEQ ID NO: 13 SEQ ID NO: 96; SEQ ID NO: 105; SEQID NO: 107 MAD14 SEQ ID NO: 34 SEQ ID NO: 54 SEQ ID NO: 14 SEQ ID NO: 97MAD15 SEQ ID NO: 35 SEQ ID NO: 55 SEQ ID NO: 15 SEQ ID NO: 98 MAD16 SEQID NO: 36 SEQ ID NO: 56 SEQ ID NO: 16 SEQ ID NO: 99 MAD17 SEQ ID NO: 37SEQ ID NO: 57 SEQ ID NO: 17 SEQ ID NO: 100 MAD18 SEQ ID NO: 38 SEQ IDNO: 58 SEQ ID NO: 18 SEQ ID NO: 101 MAD19 SEQ ID NO: 39 SEQ ID NO: 59SEQ ID NO: 19 SEQ ID NO: 102 MAD20 SEQ ID NO: 40 SEQ ID NO: 60 SEQ IDNO: 20 SEQ ID NO: 103

Editing cassettes as depicted in FIG. 3 were generated to assess thefunctionality of the MAD nucleases and corresponding guide nucleicacids. Each editing cassette comprises an editing sequence and apromoter operably linked to an encoded guide nucleic acid. The editingcassettes further comprises primer sites (P1 and P2) on flanking ends.The guide nucleic acids comprised various scaffold sequences to betested, as well as a guide sequence to guide the MAD nuclease to thetarget sequence for editing. The editing sequences comprised a PAMmutation and/or codon mutation relative to the target sequence. Themutations were flanked by regions of homology (homology arms or HA)which would allow recombination into the cleaved target sequence.

FIG. 4 depicts an experimental designed to test different MAD nucleaseand guide nucleic acid combinations. An expression cassette encoding theMAD nuclease or the MAD nuclease protein were added to host cells alongwith various editing cassettes as described above. In this example, theguide nucleic acids were engineered to target the galK gene in the hostcell, and the editing sequence was designed to mutate the targeted galKgene in order to turn the gene off, thereby allowing for screening ofsuccessfully edited cells. This design was used for identification offunctional or compatible MAD nuclease and guide nucleic acidcombinations. Editing efficiency was determined by qPCR to measure theediting plasmid in the recovered cells in a high-throughput manner.Validation of MAD11 and Cas9 primers is shown in FIGS. 14A and 14B.These results show that the selected primer pairs are orthogonal andallow quantitative measurement of input plasmid DNA

FIGS. 5A-5B is a depiction of a similar experimental design. In thiscase, the editing cassette (FIG. 5B) further comprises a selectablemarker, in this case kanamycin resistance (kan) and the MAD nucleaseexpression vector (FIG. 5A) further comprises a selectable marker, inthis case chloramphenicol resistance (Cm), and the lambda REDrecombination system to aid homologous recombination (HR) of the editingsequence into the target sequence. A compatible MAD nuclease and guidenucleic acid combination will cause a double strand break in the targetsequence if a PAM sequence is present. Since the editing sequence (eg.FIG. 3) contains a PAM mutation that is not recognized by the MADnuclease, edited cells that contain the PAM mutation survive cleavage bythe MAD nuclease, while wild-type non-edited cells die (FIG. 5C). Theediting sequence further comprises a mutation in the galK gene thatallows for screening of edited cells, while the MAD nuclease expressionvector and editing cassette contain drug selection markers, allowing forselection of edited cells.

Using these methods, compatible guide nucleic acids for MAD1-MAD20 weretested. Twenty scaffold sequences were tested. The guide nucleic acidsused in the experiments contained one of the twenty scaffold sequences,referred to as scaffold-1, scaffold-2, etc., and a guide sequence thattargets the galK gene. Sequences for Scaffold-1 through Scaffold-20 arelisted as SEQ ID NO: 84-103, respectively. It should be understood thatthe guide sequence of the guide nucleic acid is variable and can beengineered or designed to target any desired target sequence. Since MAD2does not have an endogenous scaffold sequence to test, a scaffoldsequence from a close homology (scaffold-2, SEQ ID NO: 85) was testedand found to be a non-functional pair, meaning MAD2 and scaffold-2 werenot compatible. Therefore, MAD2 was tested with the other nineteenscaffold sequences, despite the low sequence homology between MAD2 andthe other MAD nucleases.

This workflow could also be used to identify or test PAM sequencescompatible with a given MAD nuclease. Another method for identifying aPAM site is described in the next example.

In general, for the assays described, transformations were carried outas follows. E. coli strains expressing the codon optimized MAD nucleaseswere grown overnight. Saturated cultures were diluted 1/100 and grown toan OD600 of 0.6 and induced by adding arabinose at a filingconcentration of 0.4% and (if a temperature sensitive plasmid is used)shifting the culture to 42 degrees Celsius in a shaking water bath.Following induction, cells were chilled on ice for 15 min prior towashing thrice with ¼ the initial culture volume with 10% glycerol (forexample, 50 mL washed for a 200 mL culture). Cells were resuspended in1/100 the initial volume (for example, 2 mL for a 200 mL culture) andstores at −90 degrees Celsius until ready to use. To perform thecompatibility and editing efficiency screens described here, 50 ng ofediting cassette was transformed into cell aliquots by electroporation.Following electroporation, the cells were recovered in LB for 3 hoursand 100 μL of cells were plated on Macconkey plates containing 1%galactose.

Editing efficiencies were determined by dividing the number of whitecolonies (edited cells) by the total number of white and red colonies(edited and non-edited cells).

Example 4. PAM Selection Assay

In order to generate a double strand break in a target sequence, a guidenucleic acid must hybridize to a target sequence, and the MAD nucleasemust recognize a PAM sequence adjacent to the target sequence. If theguide nucleic acid hybridizes to the target sequence, but the MADnuclease does not recognize a PAM site, then cleavage does not occur.

A PAM is MAD nuclease-specific and not all MAD nucleases necessarilyrecognize the same PAM. In order to assess the PAM site requirements forthe MAD nucleases, an assay as depicted in FIGS. 6A-6C was performed.

FIG. 6A depicts a MAD nuclease expression vector as described elsewhere,which also contains a chloramphenicol resistance gene and the lambda REDrecombination system.

FIG. 6B depicts a self-targeting editing cassette. The guided nucleicacid is designed to target the target sequence which is contained on thesame nucleic acid molecule. The target sequence is flanked by randomnucleotides, depicted by N4, meaning four random nucleotides on eitherend of the target sequence. It should be understood that any number ofrandom nucleotides could also be used (for example, 3, 5, 6, 7, 8, etc).The random nucleotides serve as a library of potential PAMs.

FIG. 6C depicts the experimental design. Basically, the MAD nucleaseexpression vector and editing cassette comprising the random PAM siteswere transformed into a host cell. If a functional targetable nucleasecomplex was formed and the MAD nuclease recognized a PAM site, then theediting cassette vector was cleaved and which leads to cell death. If afunctional targetable complex was not formed or if the MAD nuclease didnot recognize the PAM, then the target sequence was not cleaved and thecell survived. Next generation sequence (NGS) was then used to sequencethe starting and final cell populations in order to determine what PAMsites were recognized by a given MAD nuclease. These recognized PAMsites were then used to determine a consensus or non-consensus PAM for agiven MAD nuclease.

The consensus PAM for MAD1-MAD8, and MAD10-MAD12 was determined to beTTTN. The consensus PAM for MAD9 was determined to be NNG. The consensusPAM for MAD13-MAD15 was determined to be TTN. The consensus PAM forMAD16-MAD18 was determined to be TA. The consensus PAM for MAD19-MAD20was determined to be TTCN.

Example 5: Testing Heterologous Guide Nucleic Acids

Editing efficiencies were tested for MAD1, MAD2, MAD4, and MAD7 and aredepicted in FIG. 7A and FIG. 7B. Experiment details and editingefficiencies are summarized in Table 3. Editing efficiency wasdetermined by dividing the number of edited cells by the total number ofrecovered cells. Various editing cassettes targeting the galK gene wereused to allow screening of editing cells. The guide nucleic acidsencoded on the editing cassette contained a guide sequence targeting thegalK gene and one of various scaffold sequences in order to test thecompatibility of the indicated MAD nuclease with the indicated scaffoldsequence, as summarized in Table 3.

Editing efficiencies for compatible MAD nuclease and guide nucleic acids(comprising the indicated scaffold sequences) were observed to havebetween 75-100% editing efficiency. MAD2 had between a 75-100% editingefficiency and MAD7 had between a 97-100% editing efficiency.

MAD2 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 inthese experiments results in 0% editing efficiency. These data implythat MAD2 did not form a functional complex with these tested guidenucleic acids and that MAD2 is not compatible with these scaffoldsequences.

MAD7 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 inthese experiments results in 0% editing efficiency. These data implythat MAD7 did not form a functional complex with these tested guidenucleic acids and that MAD7 is not compatible with these scaffoldsequences.

For MAD1 and MAD4, all tested guide nucleic acid combinations resultedin 0% editing efficiency, implying that MAD1 and MAD4 did not form afunctional complex with any of the tested guide nucleic acids. Thesedata also imply that MAD1 and MAD4 are not compatible with the testedscaffold sequences.

Combined, these data highlight the unpredictability of finding acompatible MAD nuclease and scaffold sequence pair in order to form afunctional targetable nuclease complex. Some tested MAD nucleases didnot function with any tested scaffold sequence. Some tested MADnucleases only functioned with some tested scaffold sequences and notwith others.

TABLE 3 Editing Nucleic acid- Guide nucleic acid scaffold sequenceEditing # guided nuclease sequence mutation Target gene efficiency 1MAD1 Scaffold-1; SEQ ID NO: 84 L80** galK 0% 2 MAD1 Scaffold-2; SEQ IDNO: 85 Y145** galK 0% 3 MAD1 Scaffold-4; SEQ ID NO: 87 Y145** galK 0% 4MAD1 Scaffold-10; SEQ ID NO: 93 Y145** galK 0% 5 MAD1 Scaffold-11; SEQID NO: 94 L80** galK 0% 6 MAD1 Scaffold-12; SEQ ID NO: 95 L10KpnI galK0% 7 MAD1 Scaffold-13; SEQ ID NO: 96 Y145** galK 0% 8 MAD1 Scaffold-12;SEQ ID NO: 95 L10KpnI galK 0% 9 MAD2 Scaffold-10; SEQ ID NO: 93 L80**galK 0% 10 MAD2 Scaffold-10; SEQ ID NO: 93 Y145** galK 100% 11 MAD2Scaffold-11; SEQ ID NO: 94 L80** galK 98% 12 MAD2 Scaffold-11; SEQ IDNO: 94 Y145** galK 99% 13 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK98% 14 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK 0% 15 MAD2Scaffold-13; SEQ ID NO: 96 Y145** galK 0% 16 MAD2 Scaffold-1; SEQ ID NO:84 L80** galK 0% 17 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK 0% 18MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK 0% 19 MAD2 Scaffold-4; SEQ IDNO: 87 Y145** galK 0% 20 MAD2 Scaffold-5; SEQ ID NO: 88 L80** galK 99%21 MAD2 Scaffold-12; SEQ ID NO: 95 89** galK 0% 22 MAD2 Scaffold-12; SEQID NO: 95 70** galK 75% 23 MAD2 Scaffold-12; SEQ ID NO: 95 L10KpnI galK79% 24 MAD4 Scaffold-1; SEQ ID NO: 84 L80** galK 0% 25 MAD4 Scaffold-2;SEQ ID NO: 85 Y145** galK 0% 26 MAD4 Scaffold-4; SEQ ID NO: 87 Y145**galK 0% 27 MAD4 Scaffold-10; SEQ ID NO: 93 Y145** galK 0% 28 MAD4Scaffold-11; SEQ ID NO: 94 L80** galK 0% 29 MAD4 Scaffold-12; SEQ ID NO:95 L10KpnI galK 0% 30 MAD4 Scaffold-13; SEQ ID NO: 96 Y145** galK 0% 31MAD4 Scaffold-12; SEQ ID NO: 95 L10KpnI galK 0% 32 MAD7 Scaffold-1; SEQID NO: 84 L80** galK 0% 33 MAD7 Scaffold-2; SEQ ID NO: 85 Y145** galK 0%34 MAD7 Scaffold-4; SEQ ID NO: 87 Y145** galK 0% 35 MAD7 Scaffold-10;SEQ ID NO: 93 Y145** galK 100% 36 MAD7 Scaffold-11; SEQ ID NO: 94 L80**galK 97% 37 MAD7 Scaffold-12; SEQ ID NO: 95 L10KpnI galK 0% 38 MAD7Scaffold-13; SEQ ID NO: 96 Y145** galK 0% 39 MAD7 Scaffold-12; SEQ IDNO: 95 L10KpnI galK 0%

Example 6. Assessment of MAD2 and MAD7

The ability of MAD2 and MAD7 to function with heterologous guide nucleicacids were tested using a similar experimental design as describedabove.

The compatibility of MAD2 with other scaffold sequences was tested andthe results of an experiment are depicted in FIG. 8. The MAD nucleases,guide nucleic acid scaffold sequences, and editing sequences used inthis experiment are summarized in Table 4.

The compatibility of MAD7 with other scaffold sequences was tested andthe results of an experiment are depicted in FIG. 9. The MAD nucleases,guide nucleic acid scaffold sequences, and editing sequences used inthis experiment are summarized in Table 5.

TABLE 4 Nucleic acid- Editing guided Guide nucleic acid scaffoldsequence Target # nuclease sequence mutation gene 1 MAD2 Scaffold-12;SEQ ID NO: 95 N89KpnI galK 2 MAD2 Scaffold-10; SEQ ID NO: 93 L80** galK3 MAD2 Scaffold-5; SEQ ID NO: 88 L80** galK 4 MAD2 Scaffold-12; SEQ IDNO: 95 D70KpnI galK 5 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK 6 MAD2Scaffold-11; SEQ ID NO: 94 Y145** galK 7 MAD2 Scaffold-10; SEQ ID NO: 93Y145** galK 8 MAD2 Scaffold-12; SEQ ID NO: 95 L10KpnI galK 9 MAD2Scaffold-11; SEQ ID NO: 94 L80** galK 10 SpCas9 S. pyogenese gRNA Y145**galK 11 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK 12 MAD2 Scaffold-4;SEQ ID NO: 87 Y145** galK 13 MAD2 Scaffold-1; SEQ ID NO: 84 L80** galK14 MAD2 Scaffold-13; SEQ ID NO: 96 Y145** galK

TABLE 5 Nucleic acid- Editing guided Guide nucleic acid scaffoldsequence Target # nuclease sequence mutation gene 1 MAD7 Scaffold-1; SEQID NO: 84 L80** galK 2 MAD7 Scaffold-2; SEQ ID NO: 85 Y145** galK 3 MAD7Scaffold-4; SEQ ID NO: 87 Y145** galK 4 MAD7 Scaffold-10; SEQ ID NO: 93Y145** galK 5 MAD7 Scaffold-11; SEQ ID NO: 95 L80** galK

In another experiment, transformation efficiencies (FIG. 10B) weredetermined by calculating the total number of recovered cells comparedto the start number of cells. An example plate image is depicted in FIG.10C. Editing efficiencies (FIG. 10A) were determined by calculating theratio of editing colonies (white colonies, edited galK gene) versustotal colonies.

In this example (FIG. 10A-10C), cells expressing galK were transformedwith expression constructs expressing either MAD2 or MAD7 and acorresponding editing cassette comprising a guide nucleic acid targetingthe galK gene. The guide nucleic acid was comprised of a guide sequencetargeting the galK gene and the scaffold-12 sequence (SEQ ID NO: 95).

In the depicted example, MAD2 and MAD7 has a lower transformationefficiency compared to S. pyogenes Cas9, though the editing efficiencyof MAD2 and MAD7 was slightly higher than S. pyogenes Cas9.

FIG. 11 depicts the sequencing results from select colonies recoveredfrom the assay described above. The target sequence was in the galKcoding sequence (CDS). The TTTN PAM is shown as the reverse complement(wild-type NAAA, mutated NGAA). The mutations targeted by the editingsequence are labeled as target codons. Changes compared to the wild-typesequence are highlighted. In these experiments, the scaffold-12 sequence(SEQ ID NO: 95) was used. The guide sequence of the guide nucleic acidtargeted the galK gene.

Six of the seven depicted sequences from the MAD2 experiment containedthe designed PAM mutation and designed mutations in the target codons ofgalK, which one sequences colony maintained the wild-type PAM andwild-type target codons while also containing an unintended mutationupstream of the target site.

Two of the four depicted sequences from the MAD7 experiment containedthe designed PAM mutation and mutated target codons. One colonycomprises a wildtype sequence, while another contained a deletion ofeight nucleotides upstream of the target sequence.

FIG. 12 depicts results from another experiment testing the ability torecover edited cells. In Experiment 0, the MAD2 nuclease was used with aguide nucleic acid comprising scaffold-11 sequence and a guide sequencetargeting galK. The editing cassette comprised an editing sequencedesigned to incorporate an L80** mutation into galK, thereby allowingscreening of the edited cells. In experiment 1, the MAD2 nuclease wasused with a guide nucleic acid comprising scaffold-12 sequence and aguide sequence targeting galK. The editing cassette comprised an editingsequence designed to incorporate an L10KpnI mutation into galK. In bothexperiments, a negative control plasmid a guide nucleic acid that is notcompatible with MAD2 was included in the transformations. Followingtransformation, the ratio of the compatible editing cassette (thosecontaining scaffold-11 or scaffold-12 guide nucleic acids) to thenon-compatible editing cassette (negative control) was measure. Theexperiments were done in the presence or absence of selection. Theresults show that more compatible editing cassette containing cells wererecovered compared to the non-compatible editing cassette, and thisresult is magnified when selection is used.

Example 7. Guide Nucleic Acid Characterization

The sequences of scaffolds 1-8, and 10-12 (SEQ ID NO: 84-91, and 93-95)were aligned and are depicted in FIG. 13A. Nucleotides that match theconsensus sequence are faded, while those diverging from the consensussequence are visible. The predicted pseudoknot region is indicated.Without being bound by theory, the region 5′ of the pseudoknot may beinfluence binding and/or kinetics of the nucleic acid-guided nuclease.As is shown in FIG. 13A, in general, there appears to be lessvariability in the pseudoknot region (e.g., SEQ ID NO: 172-181) ascompared to the sequence outside of the pseudoknot region.

FIG. 13B shows a preliminary model of MAD2 and MAD12 complexed with aguide nucleic acid (in this example, a guide RNA) and target sequence(DNA).

Example 8. Editing Efficiency of the MAD Nucleases

A plate-based editing efficiency assay and a molecular editingefficiency assay were used to test editing efficiency of various MADnuclease and guide nucleic acid combinations.

FIG. 15 depicts quantification of the data obtained using the molecularediting efficiency assay using MAD2 nuclease with a guide nucleic acidcomprising scaffold-12 and a guide sequencing targeting galK. Theindicated mutations were incorporated into the galK using correspondingediting cassettes containing the mutation. FIG. 16 shows the comparisonof the editing efficiencies determined by the plate-based assay usingwhite and red colonies as described previously, and the molecularediting efficiency assay. As shown in FIG. 16, the editing efficienciesas determined by the two separate assays are consistent.

Example 9. Trackable Editing

Genetic edits can be tracked by the use of a barcode. A barcode can beincorporated into or near the edit site as described in the presentspecification. When multiple rounds of engineering are being performed,with a different edit being made in each round, it may be beneficial toinsert a barcode in a common region during each round of engineering,this way one could sequence a single site and get the sequences of allof the barcodes from each round without the need to sequence each editedsite individually. FIGS. 17A-17C, 18, and 19 depict examples of suchtrackable engineering workflows.

As depicted in FIG. 17A, a cell expressing a MAD nuclease is transformedwith a plasmid containing an editing cassette and a recording cassette.The editing cassette contains a PAM mutation and a gene edit. Therecorder cassette comprises a barcode, in this case 15N. Both theediting cassette and recording cassette each comprise a guide nucleicacid to a distinct target sequence. Within a library of such plasmids,the recorder cassette for each round can contain the same guide nucleicacid, such that the first round barcode is inserted into the samelocation across all variants, regardless of what editing cassette andcorresponding gene edit is used. The correlation between the barcode andediting cassette is determined beforehand though such that the edit canbe identified by sequencing the barcode. FIG. 17B shows an example of arecording cassette designed to delete a PAM site while incorporating a15N barcode. The deleted PAM is used to enrich for edited cells sincemutated PAM cells escape cell death while cells containing a wild-typePAM sequence are killed. Fire 21 C depicts how sequencing the barcoderegion can be used to identify which edit is comprised within each cell.

A similar approach is depicted in FIG. 18. In this case, the recordercassette from each round is designed to target a sequence adjacent tothe previous round, and each time, a new PAM site is deleted by therecorder cassette. The result is a barcode array with the barcodes fromeach round that can be sequenced to confirm each round of engineeringtook place and to determine which combination of mutations are containedin the cell, and in which order the mutations were made. Each successiverecorder cassette can be designed to be homologous on one end to theregion comprising the mutated PAM from the previous round, which couldincrease the efficiency of getting fully edited cells at the end of theexperiment. In other examples, the recorder cassette is designed totarget a unique landing site that was incorporated by the previousrecorder cassette. This increases the efficiency of recovering cellscontaining all of the desired mutations since the subsequent recordercassette and barcode can only target a cell that has successfullycompleted the previous round of engineering.

FIG. 19 depicts another approach that allows the recycling of selectablemarkers or to otherwise cure the cell of the plasmid form the previousround of engineering. In this case, the transformed plasmid containing aguide nucleic acid designed to target a selectable marker or otherunique sequence in the plasmid form the previous round of engineering.

TABLE 6 SEQUENCE LISTING SEQ ID NO: Sequence SEQMGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRTSRRRLDRRQQRVKLVIDQEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFNDPTYTDKEYYSDYPTIHHLIVDLMESSEKHDPRLVYNO: 1LAVAWLVAHRGHFLNEVDKDNIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRNSVNDKYKALKSLIFGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEILGTLTPDECEWIAHIRRLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYFVKTYLAKEYDDIFRNVDSETTKNYVAYSYHVKEVKGTLPKNKATQEEFCKYVLGKVKNIECSEADKVDFDEMIQRLTDNSFMPKQVSGENRVIPYQLYYYELKTILNKAASYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWLERKAGKIYPWNFNDKVDLDKSEEAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDGYPISVDVKQQVFGLFEKKRRVTVKDIQNLLLSLGALDKHGKLTGIDTTIHSNYNTYHHFKSLMERGVLTRDDVERIVERMTYSDDTKRVRLWLNNNYGTLTADDVKHISRLRKHDFGRLSKMFLTGLKGVHKETGERASILDFMWNTNDNLMQLLSECYTFSDEITKLQEAYYAKAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKACGTAPKRIFIEMARDGESKKKRSVTRREQIKNLYRSIRKDFQQEVDFLEKILENKSDGQLQSDALYLYFAQLGRDMYTGDPIKLEHIKDQSFYNIDHIYPQSMVKDDSLDNKVLVQSEINGEKSSRYPLDAAIRNKMKPLWDAYYNHGLISLKKYQRLTRSTPFTDDEKWDFINRQLVETRQSTKALAILLKRKFPDIEIVYSKAGLSSDFRHEFGLVKSRNINDLHHAKDAFLAIVTGNVYHERFNRRWFMVNQPYSVKTKTLFTHSIKNGNFVAWNGEEDLGRIVKMLKQNKNTIHFTRFSFDRKEGLFDIQPLKASTGLVPRKAGLDVVKYGGYDKSTAAYYLLVRFTLEDKKTQHKLMMIPVEGLYKARIDHDKEFLTDYAQTTISEILQKDKQKVINIMFPMGTRHIKLNSMISIDGFYLSIGGKSSKGKSVLCHAMVPLIVPHKIECYIKAMESFARKFKENNKLRIVEKFDKITVEDNLNLYELFLQKLQHNPYNKFFSTQFDVLTNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCDLKSINGSAQAARIMISADLTGLSKKYSDIRLVEQSASGLFVSKSQNLLEYL* SEQMSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVDDFLRDFINKALNNTQIGNWIDRELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLFSSYSIKKDEKIIDDDNDVEEEELDLGKKTSSFKYIFKKNNO: 2LFKLVLPSYLKTTNQDKLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDNIRCFNVWQTECPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIGGLPAFAGHEKIQGLNEFINQECQKDSELKSKLKNRHAFKMAVLFKQILSDREKSFVIDEFESDAQVIDAVKNFYAEQCKDNNVIFNLLNLIKNIAFLSDDELDGIFIEGKYLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNKGDVEKAISKYEFSLSELNSIVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLDALLEIYNTLLIFNCKSFNKNGNFYVDYDRCINELSSVVYLYNKTRNYCTKKPYNTDKFKLNFNSPQLGEGFSKSKENDCLTLLFKKDDNYYVGIIRKGAKINFDDTQAIADNTDNCIFKMNYFLLKDAKKFIPKCSIQLKEVKAHFKKSEDDYILSDKEKFASPLVIKKSTFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAFCKEFLKTYKAATIFDITTLKKAEEYADIVEFYKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLFRINNKDFSSKSTGTKNLHTLYLQAIFDERNLNNPTIMLNGGAELFYRKESIEQKNRITHKAGSILVNKVCKDGTSLDDKIRNEIYQYENKFIDTLSDEAKKVLPNVIKKEATHDITKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVLSFLRGNPDINIIGIDRGERNLIYVTVINQKGEILDSVSFNTVTNKSSKIEQTVDYEEKLAVREKERIEAKRSWDSISKIATLKEGYLSAIVHEICLLMIKHNAIVVLENLNAGFKRIRGGLSEKSVYQKFEKMLINKLNYFVSKKESDWNKPSGLLNGLQLSDQFESFEKLGIQSGFIFYVPAAYTSKIDPTTGFANVLNLSKVRNVDAIKSFFSNFNEISYSKKEALFKFSFDLDSLSKKGFSSFVKFSKSKWNVYTFGERIIKPKNKQGYREDKRINLTFEMKKLLNEYKVSFDLENNLIPNLTSANLKDTFWKELFFIFKTTLQLRNSVTNGKEDVLISPVKNAKGEFFVSGTHNKTLPQDCDANGAYHIALKGLMILERNNLVREEKDTKKIMAISNVDWFEYVQKRRGVL* SEQMNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAIDEYHKKFIDEHLTNMSLDWIDNSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEFKKDDRFKDLFSKKLFSELLKEEIYKKGNHQEIDALKNO: 3SFDKFSGYFIGLHENRKNMYSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAESALVAHNIKMDEVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGRIHMTPLFKQILSEKESFSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALELISSYAEYDTERIYIRQADINRVSNVIFGEWGTLGGLMREYKADSINDINLERTCKKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYISKMRTAREKIDAARKEMKFISEKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHSKLFAIVPLYNKVRNYLTKNNLNTKKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYYLGIINPKRKKNIKFEQGSGNGPFYRKMVYKQIPGPNKNLPRVFLTSTKGKKEYKPSKEIIEGYEADKHIRGDKFDLDFCHKLIDFFKESIEKHKDWSKFNFYFSPTESYGDISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIYNKDFVKAATGKKDMHTIYWNAAFSPENLQDVVVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGRTPVPDKIHKKLTDYHNGRTKDLGEAKEYLDKVRYFKAHYDITKDRRYLNDKIYFHVPLTLNFKANGKKNLNKMVIEKFLSDEKAHIIGIDRGERNLLYYSIIDRSGKIIDQQSLNVIDGFDYREKLNQREIEMKDARQSWNAIGKIKDLKEGYLSKAVHEITKMAIQYNAIVVMEELNYGFKRGRFKVEKQIYQKFENMLIDKMNYLVFKDAPDESPGGVLNAYQLTNPLESFAKLGKQTGILFYVPAAYTSKIDPTTGFVNLFNTSSKTNAQERKEFLQKFESISYSAKDGGIFAFAFDYRKFGTSKTDHKNVWTAYTNGERMRYIKEKKRNELFDPSKEIKEALTSSGIKYDGGQNILPDILRSNNNGLIYTMYSSFIAAIQMRVYDGKEDYIISPIKNSKGEFFRTDPKRRELPIDADANGAYNIALRGELTMRAIAEKFDPDSEKMAKLELKHKDWFEFMQTRGD*SEQMTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDYQKVKEIIDDYHRDFIEESIDLNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKEWEALQKKLREKVVKCFSDSNKARFSRIDKKELIKNO: 4EDLINWLVAQNREDDIPTVETFNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVISFNKLKEGFPELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQGMNEQINLFKQQQTRDKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSLQKLHNNCQDKFTVLQQAILGLAEADLKKVFIKTSDLNALSNTIFGNYSVFSDALNLYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSSLDAEKQQSTDTVLNYFIKTDELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRIKAYLDTLMEAVHFAKPLYLVKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKARSYLSRKPFKADKFKINFDNNTLLSGWDANKETANASILFKKDGLYYLGIMPKGKTFLFDYFVSSEDSEKLKQRRQKTAEEALAQDGESYFEKIRYKLLPGASKMLPKVFFSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNLNDCHKMIDFFKSSIQKHPEWGSFGFTFSDTSDFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNLYLFQIYNKDFSPYSKGKPNLHTLYWKALFEEANLNNVVAKLNGEAEIFFRRHSIKASDKVVHPANQAIDNKNPHTEKTQSTFEYDLVKDKRYTQDKFFFHVPISLNFKAQGVSKFNDKVNGFLKGNPDVNIIGIDRGERHLLYFTVVNQKGEILVQESLNTLMSDKGHVNDYQQKLDKKEQERDAARKSWTTVENIKELKEGYLSHVVHKLAHLIIKYNAIVCLEDLNFGFKRGRFKVEKQVYQKFEKALIDKLNYLVFKEKELGEVGHYLTAYQLTAPFESFKKLGKQSGILFYVPADYTSKIDPTTGFVNFLDLRYQSVEKAKQLLSDFNAIRFNSVQNYFEFEIDYKKLTPKRKVGTQSKWVICTYGDVRYQNRRNQKGHWETEEVNVTEKLKALFASDSKTTTVIDYANDDNLIDVILEQDKASFFKELLWLLKLTMTLRHSKIKSEDDFILSPVKNEQGEFYDSRKAGEVWPKDADANGAYHIALKGLWNLQQINQWEKGKTLNLAIKNQDWFSFIQEKPYQE* SEQMHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYPRAKELLDDNHRAFLNRIDVLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKRRKEISAYLQDADGYKGLFAKPALDEAMKIAKENO: 5NGNESDIEVLEAFNGFSVYFTGYHESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESHPDIISEVSGNLGVDDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGFEKIQFKQLYKQILSVRTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKLVRNISSFDLRGIFVNKKNLRILSNKLIGDWDAIETALMHSSSSENDKKSVYDSAEAFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISETAPFINDLRAVDLDSLNDDGYEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIPLFNKARSFCTRKRYSTDKIKVNLKFPTLADGWDLNKERDNKAAILRKDGKYYLAILDMKKDLSSIRTSDEDESSFEKMEYKLLPSPVKMLPKIFVKSKAAKEKYGLTDRMLECYDKGMHKSGSAFDLGFCHELIDYYKRCIAEYPGWDVFDFKFRETSDYGSMKEFNEDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDYSENAHGNKNMHTMYWEGLFSPQNLESPVFKLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRIPDSIYRELTRYFNRGDCRISDEAKSYLDKVKTKKADHDIVKDRRFTVDKMMFHVPIAMNFKAISKPNLNKKVIDGIIDDQDLKIIGIDRGERNLIYVTMVDRKGNILYQDSLNILNGYDYRKALDVREYDNKEARRNWTKVEGIRKMKEGYLSLAVSKLADMIIENNAIIVMEDLNHGFKAGRSKIEKQVYQKFESMLINKLGYMVLKDKSIDQSGGALHGYQLANHVTTLASVGKQCGVIFYIPAAFTSKIDPTTGFADLFALSNVKNVASMREFFSKMKSVIYDKAEGKFAFTFDYLDYNVKSECGRTLWTVYTVGERFTYSRVNREYVRKVPTDIIYDALQKAGISVEGDLRDRIAESDGDTLKSIFYAFKYALDMRVENREEDYIQSPVKNASGEFFCSKNAGKSLPQDSDANGAYNIALKGILQLRMLSEQYDPNAESIRLPLITNKAWLTFMQSGMKTWKN* SEQMDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIIDTYHKVFIDSSLENMAKMGIDIENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLRGLIVGAFTGVCGRRENTVQNEKYESLFKEKLIKEILPNO: 6DFVLSTEAESLPFSVEEATRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFIDNILVFQKIKEPIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGKIVTEGDGEMKGLNEHINLYNQQRGREDRLPLFRPLYKQILSDREQLSYLPESFEKDEELLRALKEFYDHIAEDILGRTQQLMTSISEYDLSRIYVRNDSQLTDISKKMLGDWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESISLANLNSCIAFLDNVRDCRVDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDNVVLIKNLLDNISDLQRFLKPLWGMGDEPDKDERFYGEYNYIRGALDQVIPLYNKVRNYLTRKPYSTRKVKLNFGNSQLLSGWDRNKEKDNSCVILRKGQNFYLAIMNNRHKRSFENKVLPEYKEGEPYFEKMDYKFLPDPNKMLPKVFLSKKGIEIYKPSPKLLEQYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKFSDTATYENVSSFYREVEDQGYKLSFRKVSESYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWRMLFDERNLADVIYKLDGKAEIFFREKSLKNDHPTHPAGKPIKKKSRQKKGEESLFEYDLVKDRHYTMDKFQFHVPITMNFKCSAGSKVNDMVNAHIREAKDMHVIGIDRGERNLLYICVIDSRGTILDQISLNTINDIDYHDLLESRDKDRQQERRNWQTIEGIKELKQGYLSQAVHRIAELMVAYKAVVALEDLNMGFKRGRQKVESSVYQQFEKQLIDKLNYLVDKKKRPEDIGGLLRAYQFTAPFKSFKEMGKQNGFLFYIPAWNTSNIDPTTGFVNLFHAQYENVDKAKSFFQKFDSISYNPKKDWFEFAFDYKNFTKKAEGSRSMWILCTHGSRIKNFRNSQKNGQWDSEEFALTEAFKSLFVRYEIDYTADLKTAIVDEKQKDFFVDLLKLFKLTVQMRNSWKEKDLDYLISPVAGADGRFFDTREGNKSLPKDADANGAYNIALKGLWALRQIRQTSEGGKLKLAISNKEWLQFVQERSYEKD* SEQMNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILKDIMDDYYRGFISETLSSIDDIIDDWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIHKKFANDDRFKNMFSAKLISDILPEFVIHNNNYSASEKEENO: 7KTQVIKLFSRFATSFKDYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDINKISGDMKDSLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKNLYKLQKLHKQILCIADTSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIGDNYNGYNLDKIYIVSKFYESVSQKTYRDWETINTALEIHYNNILPGNGKSKADKVKKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISHILNNFEAQELKYNPEIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIYPVISLYNLVRNYVTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLYYLGIFNAKNKPDKKIIEGNTSENKGDYKKMIYNLLPGPNKMIPKVFLSSKTGVETYKPSAYILEGYKQNKHIKSSKDFDITFCHDLIDYFKNCIAIHPEWKNFGFDFSDTSTYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTGNDNLHTMYLKNLFSEENLKDIVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTYEAEEKDQFGNIQIVRKNIPENIYQELYKYFNDKSDKELSDEAAKLKNVVGHHEAATNIVKDYRYTYDKYFLHMPITINFKANKTGFINDRILQYIAKEKDLHVIGIDRGERNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKEIKEGYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINKLNYLVFKDISITENGGLLKGYQLTYIPDKLKNVGHQCGCIFYVPAAYTSKIDPTTGFVNIFKFKDLTVDAKREFIKKFDSIRYDSEKNLFCFTFDYNNFITQNTVMSKSSWSVYTYGVRIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDIIDYEIVQHIFEIFRLTVQMRNSLSELEDRDYDRLISPVLNENNIFYDSAKAGDALPKDADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL* SEQMTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKFHKYFIDLALSNAKLTHLEIDTYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSFSDGDAKSIFAILDKKELITVELEKWFENNEQKDIYFNO: 8DEKFKTFTTYFTGFHQNRKNMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFRELMGEFGDEGLIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQTKDKKDRLPKLKQLYKQILSDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIEGQEESQNLLLLIRQTIENLSSFDTQKIYLKNDTHLTTISQQVFGDFSVFSTALNYWYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYFSIAFLQEVLSEYILTLDHTSDIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENADQYEDELKQDQKLIDNLKFFLDAILELLHFIKPLHLKSESITEKDTAFYDVFENYYEALSLLTPLYNMVRNYVTQKPYSTEKIKLNFENAQLLNGWDANKEGDYLTTILKKDGNYFLAIMDKKHNKAFQKFPEGKENYEKMVYKLLPGVNKMLPKVFFSNKNIAYFNPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHEDWKYFDFQFSETKSYQDLSGFYREVEHQGYKINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKPNMHTLYWKALFEEQNLQNVIYKLNGQAEIFFRKASIKPKNIILHKKKIKIAKKHFIDKKTKTSEIVPVQTIKNLNMYYQGKISEKELTQDDLRYIDNFSIFNEKNKTIDIIKDKRFTVDKFQFHVPITMNFKATGGSYINQTVLEYLQNNPEVKIIGLDRGERHLVYLTLIDQQGNILKQESLNTITDSKISTPYHKLLDNKENERDLARKNWGTVENIKELKEGYISQVVHKIATLMLEENAIVVMEDLNFGFKRGRFKVEKQIYQKLEKMLIDKLNYLVLKDKQPQELGGLYNALQLTNKFESFQKMGKQSGFLFYVPAWNTSKIDPTTGFVNYFYTKYENVDKAKAFFEKFEAIRFNAEKKYFEFEVKKYSDFNPKAEGTQQAWTICTYGERIETKRQKDQNNKFVSTPINLTEKIEDFLGKNQIVYGDGNCIKSQIASKDDKAFFETLLYWFKMTLQMRNSETRTDIDYLISPVMNDNGTFYNSRDYEKLENPTLPKDADANGAYHIAKKGLMLLNKIDQADLTKKVDLSISNRDWLQFVQKNK* SEQMEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRTSRRRLDRRNWRIEILIDQEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELPYALFVDDDFTDKDYHKKFPTIYHLRKMLMNTEENO: 9TPDIRLVYLAIHHMMKHRGHFLLSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVESILKDNMLNRSTKKTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDDYIGEVENELGEQFYIIETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQFLKKIVRKYLTKEEYKDIFVSTSDKLKNYSAYIGMTKINGKKVDLQSKRCSKEEFYDFIKKNVLKKLEGQPEYEYLKEELERETFLPKQVNRDNGVIPYQIHLYELKKILGNLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKSNEKIYPWNFENVVDIEASAEKFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLDGEKLSVELKQRLYTDVFCKYRKVTVKKIKNYLKCEGIISGNVEITGIDGDFKASLTAYHDFKEILTGTELAKKDKENIITNIVLFGDDKKLLKKRLNRLYPQITPNQLKKICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWESNNNLMQLLSNEYRFMEEVETYNMGKQTKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPKRVFIEMAREKQESKRTESRKKQLIDLYKACKNEEKDWVKELGDQEEQKLRSDKLYLYYTQKGRCMYSGEVIELKDLWDNTKYDIDHIYPQSKTMDDSLNNRVLVKKKYNATKSDKYPLNENIRHERKGFWKSLLDGGFISKEKYERLIRNTELSPEELAGFIERQIVETRQSTKAVAEILKQVFPESEIVYVKAGTVSRFRKDFELLKVREVNDLHHAKDAYLNIVVGNSYYVKFTKNASWFIKENPGRTYNLKKMFTSGWNIERNGEVAWEVGKKGTIVTVKQIMNKNNILVTRQVHEAKGGLFDQQIMKKGKGQIAIKETDERLASIEKYGGYNKAAGAYFMLVESKDKKGKTIRTIEFIPLYLKNKIESDESIALNFLEKGRGLKEPKILLKKIKIDTLFDVDGFKMWLSGRTGDRLLFKCANQLILDEKIIVTMKKIVKFIQRRQENRELKLSDKDGIDNEVLMEIYNTFVDKLENTVYRIRLSEQAKTLIDKQKEFERLSLEDKSSTLFEILHIFQCQSSAANLKMIGGPGKAGILVMNNNISKCNKISIINQSPTGIFENEIDLLK SEQMNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIIDGYHKWFINETLHDLHINWSEIDLKVALENNRIEKSDASKKELQRVQKIKREEIYNAFIEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEKKKQAVENO:TFNRFSTYFTGFHENRKNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANELAPFIDGVRIE10DIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHPEFGKSKKATKMVVLFKQILSDRDTLSFIPEMFGNDKQVQNSIQLFYNREISQFENEGVKTDVCTALATLTSKIAEFDTEKIYIQQPELPNVSQRLFGSWNELNACLFKYAELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDERIENYFSETGIFAQLVKTGFDEAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEEADRDAAFYNEFDMLYNQLKLVVPLYNKVRNYITQKLFRSDKIKIYFENKGQFLGGWVDSQTENSDNGTQAGGYIFRKENVINEYDYYLGICSDPKLFRRTTIVSENDRSSFERLDYYQLKTASVYGNSYCGKHPYTEDKNELVNSIDRFVHLSGNNILIEKIAKDKVKSNPTTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALKATLATLVRVPKALVYAKKDYHLFSEIINDIDELSYEKAFSYFPVSQTEFENSSNRTIKPLLLFKISNKDLSFAENFEKGNRQKIGKKNLHTLYFEALMKGNQDTIDIGTGMVFHRVKSLNYNEKTLKYGHHSTQLNEKFSYPIIKDKRFASDKFLFHLSTEINYKEKRKPLNNSIIEFLTNNPDINIIGLDRGERHLIYLTLINQKGEILRQKTFNIVGNTNYHEKLNQREKERDNARKSWATIGKIKELKEGFLSLVIHEIAKIMVENNAIVVLEDLNFGFKRGRFKVEKQIYQKFEKMLIDKLNYLVFKDKKANEAGGVLKGYQLAEKFESFQKMGKQSGFLFYVPAAYTSKIDPTTGFVNMLNLNYTNMKDAQTLLSGMDKISFNADANYFEFELDYEKFKTNQTDHTNKWTICTVGEKRFTYNSATKETTTVNVTEDLKKLLDKFEVKYSNGDNIKDEICRQTDAKFFEIILWLLKLTMQMRNSNTKTEEDFILSPVKNSNGEFFRSNDDANGIWPADADANGAYHIALKGLYLVKECFNKNEKSLKIEHKNWFKFAQTRFNGSLTKNG* SEQMENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKFKETIEERLKYTEFSECDLGIDNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKPDKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIRKHIFKNO:GESSGSMAYRIIDENLTTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISKSENVKIQ11GINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIEMISDLINKTEISQDVIMSDIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGDGKRKKSYENDRKKHLETNVYSINYISELLTDTDVSSNIKMRYKELEQNYQVCKENFNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFYTWLSKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNSTIIMRKFNNDRGDYDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLPKQFLSKIWKAKHPTTPEFDKKYKEGRHKKGPDFEKEFLHELIDCFKHGLVNHDEKYQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSFNKIADTSNLINDGKLYVFQIWSKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDIIKKGHHHAELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNENLGQFTHIIGIDRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTKGVEHKTHYLNKLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQEKYNALIVMENLNYGFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGNQSGIVLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGEFKFDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKLILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDSRENIKNLPADADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQQE SEQMTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIIDRIYKTYADQCLQLVQLDWIDENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYFIGRTDNLTDAINKRHAEIYKGLFKAELFNGKVLKQLNO:GTVTTTEHENALLRSFDKFTTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVPSLREH12FENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIKGLNEVLNLAIQKNDETAHIIASLPHRFIPLFKQILSDRNTLSFILEEFKSDEEVIQSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFISHKKLETISSALCDHWDTLRNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKTSEILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPEFSARLTGIKLEMEPSLSFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEKNNGAILFVKNGLYYLGIMPKQKGRYKALSFEPTEKTSEGFDKMYYDYFPDAAKMIPKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITKEIYDLNNPEKEPKKFQTAYAKKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELNPLLYHISFQRIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFSPENLAKTSIKLNGQAELFYRPKSRMKRMAHRLGEKMLNKKLKDQKTPIPDTLYQELYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQAANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTIQQFDYQKKLDNREKERVAARQAWSVVGTIKDLKQGYLSQVIHEIVDLMIHYQAVVVLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYPAEKVGGVLNPYQLTDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEGFDFLHYDVKTGDFILHFKMNRNLSFQRGLPGFMPAWDIVFEKNETQFDAKGTPFIAGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSNILPKLLENDDSHAIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADANGAYHIALKGQLLLNHLKESKDLKLQNGISNQDWLAYIQELRN* SEQMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAIDELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKANO:GNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQ 13AVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI* SEQMATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELWDFVIDLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWYNO:NLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKD14MFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLRDTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFIWRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGGWEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRRYPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKKLKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNVLHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLEVEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRLEPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGEKFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPESKDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSFDLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIEDDSSKQSM* SEQMPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVDNEGKEAEIPRHAVQEEALIDAFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVNENNEAGDAQAANAWVSPLMSAESEGGLSVYDKVLNO:DPPPVWMKLKEEKAPGWEAASQIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQKKKQDELTKG15NAPLIKQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRAEYNSLKLRRDEFEAASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSLRAWNRVREEWIDKGATEEQRVTILSKLQTQLRGKFGDPDLFNWLAQDRHVHLWSPRDSVTPLVRINAVDKVLRRRKPYALMTFAHPRFHPRWILYEAPGGSNLRQYALDCTENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLEKLEKKKNRLYYRSGFQQFAGLAGGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWLDGKGRVRTPPEVHHFKTALSNKSKHTRTLQPGLRVLSVDLGMRTFASCSVFELIEGKPETGRAFPVADERSMDSPNKLWAKHERSFKLTLPGETPSRKEEEERSIARAEIYALKRDIQRLKSLLRLGEEDNDNRRDALLEQFFKGWGEEDVVPGQAFPRSLFQGLGAAPFRSTPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRPTSREMWYKTRSYHGGKSIWMLEYLDAVRKLLLSWSLRGRTYGAINRQDTARFGSLASRLLHHINSLKEDRIKTGADSIVQAARGYIPLPHGKGWEQRYEPCQLILFEDLARYRFRVDRPRRENSQLMQWNHRAIVAETTMQAELYGQIVENTAAGFSSRFHAATGAPGVRCRFLLERDFDNDLPKPYLLRELSWMLGNTKVESEEEKLRLLSEKIRPGSLVPWDGGEQFATLHPKRQTLCVIHADMNAAQNLQRRFFGRCGEAFRLVCQPHGDDVLRLASTPGARLLGALQQLENGQGAFELVRDMGSTSQMNRFVMKSLGKKKIKPLQDNNGDDELEDVLSVLPEEDDTGRITVFRDSSGIFFPCNVWIPAKQFWPAVRAMIWKVMASHSLG* SEQMTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYARATKGEMTDGRKNMF IDTHSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSAHALGFRIFSKSGEATAFQASSKIEAYESKLASENO:LACVDLSIQNLTISTLFNALTTSVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISRKIASSFGTW16KEMTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPVFEYNAEDPADIIIKLTARYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLSLLPVSTDDELLEFIGVERSHPSCHALIELIAQLEAPELFEKNVFSDTRSEVQGMIDSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTPHCSLFIGAQSLSQQLESLPEALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQINGAIKRKAIDGEKIHLPAAWSELISLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLISALQKNFDLNFNKALLNRTQHFEAMCRSTKKNALSKPEIVSYRDLLARLTSCLYRGSLVLRRAGIEVLKKHKIFESNSELREHVHERKHFVFVSPLDRKAKKLLRLTDSRPDLLHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFINLPIITQKGDRRLIDLIQYDQINRDAFVMLVTSAFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLVYVPKDKDWLVPSQMFEGRFADILQSDYMVWKDAGRLCVIDTAKHLSNIKKSVFSSEEVLAFLRELPHRTFIQTEVRGLGVNVDGIAFNNGDIPSLKTFSNCVQVKVSRTNTSLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQIHEDAAKRKIRFQMPATELVHASDDAGWTPSYLLGIDPGEYGMGLSLVSINNGEVLDSGFIHINSLINFASKKSNHQTKVVPRQQYKSPYANYLEQSKDSAAGDIAHILDRLIYKLNALPVFEALSGNSQSAADQVWTKVLSFYTWGDNDAQNSIRKQHWFGASHWDIKGMLRQPPTEKKPKPYIAFPGSQVSSYGNSQRCSCCGRNPIEQLREMAKDTSIKELKIRNSEIQLFDGTIKLFNPDPSTVIERRRHNLGPSRIPVADRTFKNISPSSLEFKELITIVSRSIRHSPEFIAKKRGIGSEYFCAYSDCNSSLNSEANAAANVAQKFQKQLFFEL* SEQMKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLFGQKEIVDLMEKDEGTQVIDYSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSPFRKVLEQSPFFLAGRLKVEPAERILSVEIRKIGKRENO:NRVENYAADVETCFIGQLSSDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEEDPENETAGK17QKPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKGNGMNSLLGTFLKNLQGDGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLPKPKFAGEWSGHRMFLHGQLKSWSSNFFRLFNETRELLESIKSDIQHATMLISYVEEKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIEMTHMSEAVRSYIMIHKSVAGFLPDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFTANNLFRNFLENPKHVPRFMAERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQFNESFLRQRLQAMLTVYKRDLQTEKFLKLLADVCRPLVDFFGLGGNDIIFKSCQDPRKQWQTVIPLSVPADVYTACEGLAIRLRETLGFEWKNLKGHEREDFLRLHQLLGNLLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAIDLPLEIFGFEVPIFLNGYLFSELRQLELLLRRKSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNFQERLDTPTGLSRRFLDLTLDAFAGKLLTDPVTQELKTMAGFYDHLFGFKLPCKLAAMSNHPGSSSKMVVLAKPKKGVASNIGFEPIPDPAHPVFRVRSSWPELKYLEGLLYLPEDTPLTIELAETSVSCQSVSSVAFDLKNLTTILGRVGEFRVTADQPFKLTPIIPEKEESFIGKTYLGLDAGERSGVGFAIVTVDGDGYEVQRLGVHEDTQLMALQQVASKSLKEPVFQPLRKGTFRQQERIRKSLRGCYWNFYHALMIKYRAKVVHEESVGSSGLVGQWLRAFQKDLKKADVLPKKGGKNGVDKKKRESSAQDTLWGGAFSKKEEQQIAFEVQAAGSSQFCLKCGWWFQLGMREVNRVQESGVVLDWNRSIVTFLIESSGEKVYGFSPQQLEKGFRPDIETFKKMVRDFMRPPMFDRKGRPAAAYERFVLGRRHRRYRFDKVFEERFGRSALFICPRVGCGNFDHSSEQSAVVLALIGYIADKEGMSGKKLVYVRLAELMAEWKLKKLERSRVEEQSSAQ* SEQMAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYGSASKAQSQRIAVAG IDALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDEIGLLGPSSEYACWIASQKQSEPYSVVDFWFDNO:AVCAGGVFAYSGARLLSTVLQLSGEESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLGKRIGECFAE18GRLEALGIKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPDYYVPEENRADQLVVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFLGRLLNNDIIALANNMSAMTPYWEGRKGELIERLAWLKHRAEGLYLKEPHFGNSWADHRSRIFSRIAGWLSGCAGKLKIAKDQISGVRTDLFLLKRLLDAVPQSAPSPDFIASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANKAVQRSDSQEWLIKELDAVDHLEFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEVNGQEGNGASKNQKKFQRIPRFFGEGSRSEYRILTEAPQYFDMFCNNMRAIFMQLESQPRKAPRDFKCFLQNRLQKLYKQTFLNARSNKCRALLESVLISWGEFYTYGANEKKFRLRHEASERSSDPDYVVQQALEIARRLFLFGFEWRDCSAGERVDLVEIHKKAISFLLAITQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEEFLNATVLSQMRGLAIRLSSQELKDGFDVQLESSCQDNLQHLLVYRASRDLAACKRATCPAELDPKILVLPVGAFIASVMKMIERGDEPLAGAYLRHRPHSFGWQIRVRGVAEVGMDQGTALAFQKPTESEPFKIKPFSAQYGPVLWLNSSSYSQSQYLDGFLSQPKNWSMRVLPQAGSVRVEQRVALIWNLQAGKMRLERSGARAFFMPVPFSFRPSGSGDEAVLAPNRYLGLFPHSGGIEYAVVDVLDSAGFKILERGTIAVNGFSQKRGERQEEAHREKQRRGISDIGRKKPVQAEVDAANELHRKYTDVATRLGCRIVVQWAPQPKPGTAPTAQTVYARAVRTEAPRSGNQEDHARMKSSWGYTWGTYWEKRKPEDILGISTQVYWTGGIGESCPAVAVALLGHIRATSTQTEWEKEEVVFGRLKKFFPS* SEQMEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQVISNNAANNLRMLLDDIDYTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNKLKPEMDEKGNLTTAGFACSQCGQPLFVYKLENO:QVSEKGKAYTNYFGRCNVAEHEKLILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKESTHPVKPLAQ19IAGNRYASGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLEYPSVTLPPQPHTKEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPSFPVVERRENEVDWWNTINEVKKLIDAKRDMGRVFWSGVTAEKRNTILEGYNYLPNENDHKKREGSLENPKKPAKRQFGDLLLYLEKKYAGDWGKVFDEAWERIDKKIAGLTSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLKEMDEKEFYACEIQLQKWYGDLRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYLLMNYGKKGRIRFTDGTDIKKSGKWQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQGREFIWNDLLSLETGLIKLANGRVIEKTIYNKKIGRDEPALFVALTFERREVVDPSNIKPVNLIGVDRGENIPAVIALTDPEGCPLPEFKDSSGGPTDILRIGEGYKEKQRAIQAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDLFYHAVTHDAVLVFENLSRGFGRQGKRTFMTERQYTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCSNCGFTITTADYDGMLVRLKKTSDGWATTLNNKELKAEGQITYYNRYKRQTVEKELSAELDRLSEESGNNDISKWTKGRRDEALFLLKKRFSHRPVQEQFVCLDCGHEVHADEQAALNIARSWLFLNSNSTEFKSYKSGKQPFVGAWQAFYKRRLKEVWKPNA SEQMKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQPISNTSRANLNKLLTDYTEIDMKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKLIPVKDGNERLTSSGFACSQCCQPLYVYKLEQVNDNO:KGKPHTNYFGRCNVSEHERLILLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPVKPLEQIGGNSC20ASGPVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPKITLPPQPHTKEGIEAYNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFPLVERQANEVDWWDMVCNVKKLINEKKEDGKVFWQNLAGYKRQEALLPYLSSEEDRKKGKKFARYQFGDLLLHLEKKHGEDWGKVYDEAWERIDKKVEGLSKHIKLEEERRSEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKLQKWYGDLRGKPFAIEAENSILDISGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEANRFYTVINKKSGEIVPMEVNFNFDDPNLIILPLAFGKRQGREFIWNDLLSLETGSLKLANGRVIEKTLYNRRTRQDEPALFVALTFERREVLDSSNIKPMNLIGIDRGENIPAVIALTDPEGCPLSRFKDSLGNPTHILRIGESYKEKQRTIQAAKEVEQRRAGGYSRKYASKAKNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGRQGKRTFMAERQYTRMEDWLTAKLAYEGLPSKTYLSKTLAQYTSKTCSNCGFTITSADYDRVLEKLKKTATGWMTTINGKELKVEGQITYYNRYKRQNVVKDLSVELDRLSEESVNNDISSWTKGRSGEALSLLKKRFSHRPVQEKFVCLNCGFETHADEQAALNIARSWLFLRSQEYKKYQTNKTTGNTDKRAFVETWQSFYRKKLKEVWKP SEQatgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATGCCGTAACCGACCCATCID GTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGGTGCCCACGTGTTTGCTGCGGGGAATCAATCNO:AGCTGAACGGAGAAGCTTTCGTACGAGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTCAAACTGGT 21TCAAGAAATCTTTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCATGAGAGCGCTTTATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGACCCGACCTATACGGATAAGGAATATTATTCTGACTATCCAACCATCCATCATCTCATTGTGGACCTTATGGAAAGCAGTGAAAAGCATGACCCGCGGCTTGTTTATTTGGCTGTTGCCTGGCTGGTTGCTCATCGTGGTCATTTCCTCAATGAAGTGGATAAGGATAATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAGTTTCTGGCATTTCTTTCCGATAATGGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGACCCTGCTTTCACGAAACTCCGTCAACGATAAGTATAAAGCCTTGAAGTCTCTGATCTTTGGCAGCCAAAAGCCGGAGGATAATTTTGATGCCAATATCAGTGAAGATGGACTTATCCAACTTTTAGCAGGAAAAAAGGTCAAGGTCAATAAACTTTTTCCTCAAGAAAGTAATGATGCTTCCTTTACACTCAATGATAAGGAAGATGCAATTGAGGAAATCTTAGGAACGCTTACACCGGATGAGTGTGAATGGATTGCGCATATTAGGAGGCTGTTTGATTGGGCCATCATGAAACATGCTCTCAAAGATGGCAGAACAATCTCCGAATCGAAAGTAAAGCTCTATGAACAGCATCACCATGACTTGACACAGCTCAAGTATTTTGTGAAGACCTATCTAGCAAAGGAATATGATGACATTTTTCGAAACGTAGATAGTGAAACAACCAAAAACTATGTCGCATATTCCTATCATGTAAAAGAAGTCAAGGGTACATTGCCCAAAAATAAGGCAACCCAAGAAGAATTTTGCAAGTATGTCCTTGGAAAGGTAAAGAACATCGAATGCAGTGAAGCTGATAAGGTTGATTTTGATGAAATGATTCAGCGTCTTACAGACAATTCCTTTATGCCGAAACAAGTATCAGGTGAAAACAGGGTTATCCCTTACCAGCTTTACTATTATGAACTAAAGACTATTTTGAATAAAGCCGCTTCTTATCTGCCTTTTTTGACCCAATGCGGAAAAGATGCCATCTCCAATCAAGATAAGCTCCTTTCCATCATGACCTTTCGGATTCCGTATTTCGTTGGGCCCTTGCGCAAGGACAATTCAGAGCATGCCTGGCTGGAACGAAAAGCAGGGAAAATCTATCCGTGGAATTTTAACGACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTCATTCGGAGAATGACGAATACCTGCACTTATTATCCCGGTGAAGATGTTTTGCCACTTGACTCCCTTATTTATGAAAAATTCATGATCCTCAATGAAATCAATAATATCCGAATTGATGGTTATCCTATTTCTGTAGATGTAAAACAGCAGGTTTTTGGCCTCTTTGAAAAGAAGAGAAGAGTGACCGTAAAGGATATCCAGAATCTCCTGCTTTCCTTGGGTGCCTTGGATAAGCATGGTAAATTGACGGGAATCGATACTACCATCCATAGCAATTACAATACATACCATCATTTTAAATCGCTCATGGAGCGTGGCGTTCTTACTCGTGATGATGTGGAACGCATTGTGGAGCGTATGACCTATAGTGATGATACAAAACGCGTCCGTCTTTGGCTGAACAATAATTATGGAACGCTCACTGCTGACGACGTAAAGCATATTTCAAGGCTCCGAAAGCATGATTTTGGCCGGCTTTCCAAAATGTTCCTCACAGGCCTAAAGGGAGTTCATAAGGAAACGGGGGAACGAGCTTCCATTTTGGATTTTATGTGGAATACCAATGATAACTTGATGCAGCTTTTATCTGAATGTTATACTTTTTCGGATGAAATTACCAAGCTGCAGGAAGCATACTATGCCAAGGCGCAGCTTTCCCTGAATGATTTTCTGGACTCCATGTATATTTCAAATGCTGTCAAACGTCCTATCTATCGAACTCTTGCCGTTGTAAATGACATACGCAAAGCCTGTGGGACGGCGCCAAAACGCATTTTTATCGAAATGGCAAGAGATGGGGAAAGCAAAAAGAAAAGGAGCGTAACAAGAAGAGAACAAATCAAGAATCTTTATAGGTCCATCCGCAAGGATTTTCAGCAGGAGGTAGATTTCCTTGAAAAAATCCTTGAAAACAAAAGCGATGGACAGCTGCAAAGCGATGCGCTCTATCTATACTTTGCGCAGCTTGGAAGGGATATGTATACCGGGGACCCTATCAAGTTGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCATATCTATCCCCAAAGCATGGTCAAGGACGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAATGGAGAGAAGAGCAGTCGATATCCTCTTGATGCTGCTATCCGTAATAAAATGAAGCCTCTTTGGGATGCTTATTATAACCATGGCCTGATTTCCCTCAAGAAGTATCAGCGTTTGACGCGGAGCACTCCCTTTACAGATGATGAAAAGTGGGATTTCATCAATCGGCAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTTACTAAAAAGGAAGTTCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCGGCATGAGTTTGGTCTCGTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAGGACGCATTTCTTGCGATTGTAACAGGAAATGTCTATCATGAACGCTTTAATCGCCGGTGGTTTATGGTGAACCAGCCCTATTCCGTCAAGACCAAGACGTTGTTTACGCATTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGGATCTTGGCCGCATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTTTTGATCGAAAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCTTGTACCAAGAAAAGCCGGACTAGACGTGGTAAAATATGGTGGCTATGACAAATCGACAGCAGCTTATTATCTCCTTGTTCGATTTACACTAGAAGATAAAAAGACTCAACATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCTCGAATTGACCATGATAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAAAAGATAAACAAAAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAACTGAATTCCATGATTTCAATCGATGGTTTTTATCTTTCCATTGGAGGAAAGTCTAGTAAGGGAAAATCGGTGTTGTGTCATGCTATGGTACCTCTTATTGTACCTCATAAGATAGAATGTTATATTAAGGCGATGGAGTCTTTTGCACGTAAATTTAAAGAAAATAATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAAGATAACTTGAACCTATACGAACTATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTCTCCACACAATTTGATGTGCTGACTAATGGAAGAAGTACATTTACTAAATTATCTCCAGAGGAACAAGTTCAAACGTTATTGAATATCTTATCAATTTTTAAAACTTGTCGGAGCTCTGGCTGCGATTTAAAATCCATTAACGGTTCTGCTCAAGCTGCCAGAATTATGATCAGCGCAGATTTAACTGGACTCTCAAAAAAATATTCCGATATTCGGCTTGTTGAGCAATCAGCATCTGGACTTTTTGTTAGTAAATCACAAAATCTTTTGGAGTATTTAtga SEQatgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaaatgaactcatcccagtaggaaIDagactctcgagaacattaaggaaaacggtctcatagatggagatgaacagctaaacgagaattatcaaaaagcaaaNO:gataatcgttgatgattttctacgagatttcataaataaagctttaaataatacccaaataggaaattggagagaa22ttagcagatgctttaaataaagaagatgaagataacatagaaaagctccaagacaaaatcagaggaataattgtaagtaaattcgagacatttgatttgttttcttcttactcgataaagaaagacgaaaagataatagatgatgataatgatgttgaagaagaggagctagatctaggaaaaaaaacttcctcatttaaatatatttttaagaaaaacctttttaaattagtacttccttcttatttaaagacaacaaatcaggataaactgaaaataatctcttcttttgataatttttctacctatttcagaggattctttgagaacagaaaaaatattttcactaagaagcctatatctacgtcaattgcctacagaattgtccatgataactttccaaagtttctagataacatcagatgttttaatgtgtggcaaacagaatgcccacagttaattgtaaaggctgataattatttaaaatcaaagaacgtcatagctaaagataaatctttagcaaactattttactgtaggagcatatgattacttcttatcccagaatggcattgatttctacaacaacattatcggcggtctaccagcatttgctggtcatgagaaaatccaaggacttaatgaatttataaatcaagaatgccaaaaggacagcgaactaaaatctaaactgaaaaacagacatgctttcaaaatggctgttctatttaagcaaattctttcagatagagaaaaaagttttgttatagacgagttcgaatctgatgctcaggtcatagatgcggttaagaacttctatgcagaacaatgtaaggataataatgttatttttaaccttctaaatcttatcaagaatatagcgttcttatctgatgatgaattagatggaatttttatagaaggcaagtatttaagctctgtttcccaaaagctatattcagattggtcgaagcttcgaaatgatattgaagatagtgcaaacagtaaacaaggaaataaagagttagcaaagaaaattaaaacaaataaaggcgatgttgaaaaggccataagtaaatatgagttttctttatcagaacttaactcaattgtacatgataatacaaaattcagtgaccttctttcttgtacgttacataaagtggctagcgaaaaactagtgaaagttaatgaaggggactggccaaaacacctgaaaaataatgaagaaaaacaaaagataaaagagcctttagatgcattgttagaaatttataatacattgctgatattcaactgcaagtcatttaataagaacggtaatttctatgttgattatgacagatgcataaatgagctttctagtgttgtttatttatataacaaaacaagaaattactgtacaaagaaaccttataacacagacaaattcaaattaaactttaacagtcctcaattaggagagggctttagtaagtcgaaagaaaatgactgtctgacattattatttaaaaaagacgacaattactatgttggaattatcagaaaaggggcaaaaattaactttgatgatacacaagccattgcagacaatacagataactgtatatttaagatgaattatttcctattaaaagatgctaaaaagtttattcctaaatgttcaattcagttaaaagaagtaaaagcacattttaaaaaatcagaggatgattatatcctgagtgacaaagaaaaatttgcctctccccttgttattaagaaatcaacatttttattagcaacagcacatgtaaaaggaaagaaaggaaacataaaaaaattccaaaaggaatattctaaggaaaatccaacagaatatagaaattctctgaatgaatggattgcattttgtaaagaatttctaaaaacatataaggcggcaacaatctttgacattacaacgttaaaaaaagctgaagaatatgctgatattgttgagttttataaggatgtagataatctttgttataaactagagttttgccctattaaaacatctttcattgagaatcttattgataatggggacttatatttattcagaatcaataataaagatttcagttcaaaatctactggtacaaagaatcttcatacgctctatcttcaggcaatctttgatgaaagaaacctcaataatcctactattatgttaaatggcggagcagagttattttatcgaaaagaaagcattgaacagaaaaataggataactcataaggcaggatcaattcttgtaaacaaggtttgtaaggatggaacaagtctagatgacaaaatcagaaacgaaatatatcaatatgaaaacaagtttattgatacattgtctgatgaagctaaaaaagttttacctaatgtaataaaaaaagaagcaactcacgacataacaaaaaaaaaggagaggtgtcctgtaaagataagcgatttacatcagataagttctttttccattgcccattaacaattaactataaggaaggagatacaaaacaatttaacaatgaggttttatctttccttagaggtaatccagacattaatatcatcggaattgacagaggagaaagaaaccttatatacgtaactgttattaatcagaaaggcgaaatacttgacagcgtttcgtttaacacagtaacaaacaagtcgagcaaaattgaacaaactgttgattatgaggaaaagcttgctgttagggaaaaagaaagaatagaagcaaaaagatcctgggattcaatatcaaagatagcaaccttaaaagaaggttatctatcagctattgttcatgagatatgcctactgatgatcaaacacaacgcaatcgttgtacttgagaatctaaatgcaggatttaagagaattagaggaggattatcagaaaagtctgtttatcagaaattcgagaagatgcttattaacaaactaaattactttgtatctaaaaaagaatcagactggaataaacctagtggacttttaaatggtttacaactttcagaccagttcgagtcatttgagaaattaggaattcaatctgggttcatcttctatgttcctgcagcatatacatctaagattgatcctacaacaggatttgcaaatgttcttaacttatccaaggtaagaaatgttgatgcaataaagagttttttcagtaatttcaatgaaatttcatatagcaaaaaagaagctctctttaaattctcttttgatttagattccttatcaaagaagggcttcagctcatttgtaaaattcagtaaatctaaatggaatgtatatacatttggagagagaataataaaaccaaagaataagcaagggtatcgtgaagataagagaattaatttaacatttgaaatgaaaaaacttctgaatgaatataaagtaagttttgatcttgaaaacaacttaattccaaatctaacctctgcaaatctgaaagataccttctggaaagaactattctttatttttaaaacaactctgcagcttagaaacagtgtaacaaatggcaaagaagatgtactgatttctccagtaaagaacgctaaaggagagttctttgtatcaggaactcataacaagacattacctcaagactgtgatgcaaatggagcatatcatatcgccctaaaaggtctgatgattcttgaacgtaacaatcttgttagagaagaaaaagacacaaagaagataatggcaatttctaatgttgactggtttgagtatgttcaaaaaaggagaggtgtcctgtaaSEQATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGTTCGAATTGAAGCCG IDCAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTTCGAAGAGGACAGGGATAGAGCGGAGAA NO:ATATAAGATTTTAAAGGAAGCAATCGACGAGTATCATAAGAAGTTTATAGACGAACATCTAACAAATAT 23GTCTCTTGACTGGAATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGAGGAAAAAGACAAGAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTTCAAAAAAGACGATCGGTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTCTCAAGGAAGAGATTTACAAAAAAGGAAACCATCAGGAAATTGACGCATTGAAAAGTTTTGATAAATTCTCAGGCTATTTTATTGGGTTGCATGAGAACCGAAAAAATATGTATTCTGACGGAGACGAGATCACGGCTATCTCTAACCGTATTGTAAATGAGAATTTCCCGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGTAAAAAATATCCAGAGTGGATCATTAAGGCAGAATCTGCTTTAGTTGCACATAATATCAAGATGGATGAAGTCTTTTCCTTAGAGTATTTCAACAAAGTCCTGAATCAAGAAGGAATACAGAGATACAATCTCGCCCTAGGTGGCTATGTGACCAAAAGTGGTGAGAAAATGATGGGGCTTAATGATGCACTTAATCTTGCCCATCAAAGTGAAAAAAGCAGCAAGGGAAGGATACACATGACTCCACTCTTCAAACAGATTCTGAGTGAAAAAGAGTCCTTTTCTTATATACCAGATGTTTTTACAGAAGACTCTCAACTTTTACCATCCATTGGTGGGTTCTTTGCACAAATAGAAAATGATAAGGACGGGAATATTTTTGACAGAGCATTAGAATTGATATCTTCTTATGCAGAATACGATACAGAAAGGATATATATCAGGCAAGCGGACATAAACAGAGTTTCTAATGTTATTTTCGGGGAGTGGGGAACACTGGGGGGGTTAATGAGGGAATACAAAGCAGACTCTATCAACGACATCAATTTGGAGAGAACATGCAAGAAGGTAGACAAGTGGCTCGACTCAAAGGAGTTTGCGTTATCAGATGTATTAGAGGCAATAAAAAGAACCGGCAATAATGATGCTTTTAATGAATATATCTCAAAGATGCGCACTGCCAGGGAAAAGATTGACGCTGCAAGAAAGGAAATGAAATTCATTTCGGAAAAAATATCTGGAGACGAAGAATCGATCCATATTATCAAAACCTTATTGGACTCGGTGCAACAGTTTTTACATTTTTTCAATTTATTCAAAGCGCGTCAGGACATTCCTCTTGATGGAGCATTCTATGCGGAGTTCGATGAAGTCCATAGCAAACTGTTTGCTATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTTACGAAAAATAACCTTAACACGAAAAAGATAAAGCTAAACTTCAAGAATCCAACTCTGGCAAACGGATGGGATCAAAACAAGGTATATGACTACGCCTCCTTAATCTTTCTCCGCGATGGTAATTATTATCTCGGAATAATAAATCCAAAAAGGAAAAAGAATATTAAATTCGAACAAGGGTCTGGAAATGGCCCATTCTACCGGAAGATGGTGTACAAACAAATTCCAGGGCCGAACAAGAACTTACCAAGAGTCTTCCTCACATCTACGAAAGGCAAAAAAGAGTACAAGCCGTCAAAGGAGATAATAGAAGGATATGAAGCGGACAAACACATAAGAGGAGATAAATTCGATCTGGATTTCTGTCATAAGCTGATAGACTTCTTCAAGGAATCCATCGAGAAGCACAAGGACTGGAGTAAGTTCAACTTCTATTTCTCTCCAACTGAATCATATGGAGACATCAGCGAATTCTATCTGGATGTAGAAAAACAGGGATACCGGATGCATTTTGAGAATATTTCTGCCGAGACGATTGATGAGTATGTCGAAAAGGGGGACTTATTCCTCTTCCAGATATACAACAAAGACTTTGTGAAAGCGGCAACCGGAAAAAAAGATATGCACACCATTTATTGGAACGCGGCATTCTCGCCCGAGAACCTTCAGGATGTGGTAGTGAAACTGAACGGTGAAGCAGAACTTTTCTACAGAGACAAGAGCGACATCAAGGAGATAGTTCACAGGGAGGGAGAGATACTGGTCAATCGTACCTACAACGGCAGGACACCTGTGCCTGACAAGATCCACAAAAAATTAACAGATTATCATAATGGCCGTACCAAAGATCTCGGAGAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAAGCGCACTACGACATCACAAAGGATCGCAGATACCTGAATGATAAAATATACTTCCATGTGCCTCTGACATTGAATTTCAAAGCAAACGGGAAGAAGAATCTCAATAAGATGGTAATTGAAAAGTTCCTCTCGGACGAAAAAGCGCATATTATTGGGATTGATCGCGGGGAAAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAGGTAAAATAATCGATCAACAGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGAGGGAGATCGAGATGAAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGACCTCAAGGAAGGGTATCTTTCAAAAGCGGTCCACGAAATTACCAAGATGGCGATACAATACAATGCCATTGTTGTCATGGAGGAACTCAATTATGGGTTCAAACGCGGACGTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAGAATATGCTGATTGACAAGATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAGTCCTCAATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGACAGGAATTCTTTTCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGACCGGGTTTGTCAATCTTTTCAATACTTCAAGTAAAACGAACGCACAGGAAAGAAAAGAATTCTTGCAAAAATTCGAGTCGATCTCCTATTCCGCTAAAGACGGAGGAATATTCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAGACCACAAAAATGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAACGCAACGAACTGTTCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAGGAATCAAATATGACGGCGGACAGAACATATTGCCAGATATCCTGAGGAGCAACAATAACGGTCTGATCTACACAATGTATTCCTCTTTCATAGCGGCCATTCAAATGAGGGTCTATGACGGGAAAGAAGACTATATCATCTCGCCGATAAAGAACAGCAAGGGAGAGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGACGCGGATGCGAACGGCGCGTATAACATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAGAAGTTCGATCCGGACTCGGAAAAGATGGCGAAGCTAGAACTGAAACATAAGGACTGGTTCGAATTCATGCAGACAAGGGGGGATTGA SEQATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAGTTCGTTTTGAACTCAAID GCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAAAAACGAAGGTTTGAAACGAGTTGTTTCAGANO:GGATGAACGGCGTGCGGTTGATTACCAAAAAGTGAAAGAAATTATTGATGACTACCACCGAGATTTTAT 24TGAAGAATCGCTGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCTTTTCACCTTTATCAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAATGGGAAGCCCTTCAGAAAAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATTCAAACAAAGCACGCTTTTCCCGCATTGATAAAAAAGAACTGATTAAAGAAGATTTAATTAACTGGTTGGTTGCACAAAATCGCGAAGATGACATTCCAACCGTTGAAACCTTTAACAACTTTACGACTTATTTTACGGGGTTTCATGAAAACCGAAAAAACATTTATTCAAAAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAAAACCTGCCTAAGTTTTTTGATAATGTGATCAGCTTTAATAAATTGAAGGAAGGATTTCCAGAGCTGAAATTTGATAAGGTTAAGGAAGATTTAGAAGTTGATTATGACTTGAAACATGCCTTTGAAATCGAATACTTTGTCAATTTTGTTACCCAAGCCGGAATTGACCAATATAACTATCTTTTGGGGGGTAAAACCTTAGAAGACGGCACCAAAAAGCAAGGCATGAATGAACAAATCAATCTGTTCAAGCAACAGCAAACCCGAGACAAAGCCCGACAAATTCCCAAACTCATACCATTGTTTAAACAAATTCTAAGCGAACGAACGGAAAGCCAATCGTTTATTCCAAAACAATTTGAATCAGACCAAGAGCTATTTGACTCACTGCAAAAACTGCATAACAACTGCCAAGATAAATTTACCGTACTGCAACAAGCCATTTTAGGCTTAGCCGAAGCAGATCTGAAAAAAGTATTCATTAAAACATCTGATCTTAATGCGCTATCAAATACCATTTTTGGAAATTACAGTGTGTTTTCGGATGCGTTGAATTTATACAAAGAATCGCTCAAAACAAAAAAGGCGCAAGAAGCGTTTGAAAAACTACCCGCTCACAGCATTCATGACTTGATTCAATATTTGGAGCAATTTAATAGCTCTTTGGATGCAGAAAAACAGCAATCAACTGACACCGTACTGAATTACTTTATTAAAACAGACGAGCTGTATTCTCGGTTCATAAAATCAACGAGCGAAGCCTTCACACAAGTACAACCACTCTTTGAATTGGAAGCATTAAGCTCAAAACGTCGTCCACCGGAAAGTGAAGACGAAGGCGCAAAAGGTCAGGAAGGGTTTGAGCAAATTAAACGCATAAAAGCCTATTTGGATACCTTGATGGAGGCGGTGCATTTTGCAAAACCACTTTATCTGGTGAAGGGGCGCAAAATGATTGAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCCTTTGAAATGGCTTACCAAGAACTAGAAAGTCTGATTATTCCAATCTACAACAAAGCTCGTAGTTATTTAAGTCGTAAACCGTTTAAAGCGGACAAATTCAAAATTAATTTTGATAATAATACATTGCTTTCCGGTTGGGATGCTAATAAAGAAACGGCTAACGCTTCAATTTTGTTTAAGAAGGATGGTTTGTATTATTTAGGAATCATGCCTAAAGGAAAAACGTTTTTGTTCGATTACTTCGTTTCATCGGAAGATTCTGAAAAGTTAAAACAAAGAAGACAAAAAACCGCCGAAGAAGCGCTTGCGCAAGATGGCGAAAGCTACTTTGAAAAAATTCGTTACAAGCTGTTACCTGGCGCCAGCAAAATGTTGCCGAAAGTATTTTTTTCCAACAAAAACATAGGGTTTTACAACCCAAGTGATGACATACTTCGTATCAGGAATACAGCCTCTCACACTAAAAACGGAACACCGCAAAAAGGGCACTCTAAAGTAGAGTTTAATTTGAATGATTGTCATAAGATGATTGATTTCTTTAAATCAAGCATTCAAAAGCATCCAGAGTGGGGAAGTTTTGGATTCACCTTTTCAGATACATCAGATTTTGAAGATATGAGCGCCTTTTATCGAGAAGTCGAAAACCAAGGTTATGTCATTAGTTTCGATAAAATAAAAGAAACTTACATTCAGAGTCAAGTTGAACAGGGGAACCTATATTTATTCCAAATCTACAATAAAGACTTCTCGCCCTACAGCAAAGGCAAACCAAATTTACACACGCTTTACTGGAAAGCGTTGTTTGAGGAAGCCAACCTAAATAATGTGGTGGCAAAACTCAATGGTGAAGCTGAAATTTTCTTTAGGCGACACTCAATCAAAGCATCTGATAAAGTGGTGCACCCAGCGAATCAAGCCATTGACAATAAAAACCCGCATACCGAAAAAACGCAAAGCACCTTTGAATATGATCTTGTAAAAGACAAGCGCTATACCCAAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGGCACAAGGTGTTTCAAAATTTAACGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATGTCAATATTATTGGCATTGACCGAGGCGAACGACACCTTCTGTATTTCACTGTGGTGAATCAGAAAGGTGAAATTTTGGTTCAAGAGTCGCTTAATACCCTAATGAGTGATAAAGGGCATGTGAATGACTACCAGCAAAAACTCGACAAAAAAGAACAAGAACGCGATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTAAAAGAAGGCTATTTATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCCATTGTTTGCTTGGAAGACCTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGGAAAAACAAGTTTATCAGAAATTTGAAAAAGCGCTTATTGATAAGCTTAACTACTTGGTATTTAAAGAAAAAGAGTTAGGCGAGGTGGGCCATTATCTAACCGCCTATCAGTTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAGTGGCATATTGTTTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAACTTTCTTGATCTGCGTTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGACTTTAATGCCATTCGTTTTAATTCAGTACAAAACTATTTTGAGTTCGAAATAGATTACAAAAAACTCACACCCAAACGTAAAGTTGGTACTCAGAGTAAATGGGTGATTTGTACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAAAGGTCACTGGGAAACGGAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCCAAAACTACAACCGTAATCGATTACGCCAATGACGACAACCTAATTGACGTCATTCTGGAACAGGACAAAGCCAGCTTCTTCAAAGAACTGTTATGGTTATTAAAACTCACCATGACGCTCCGCCACAGCAAAATCAAAAGTGAAGACGACTTTATTCTTTCACCCGTTAAAAACGAACAAGGCGAGTTTTACGATAGTCGAAAAGCGGGCGAGGTGTGGCCTAAAGATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAAAGGCTTGTGGAATCTGCAACAGATCAATCAGTGGGAAAAGGGTAAAACACTTAATCTGGCGATTAAAAACCAGGATTGGTTCAGTTTTATTCAAGAAAAGCCCTATCAAGAATAA SEQATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACCCCCTTTCGAAGACT IDCTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAATCTCGAAGCATCGGGGTATCTTGCGGAGNO:GACAGACACCGTGCAGAATGCTATCCCAGGGCAAAAGAGCTCTTGGACGACAACCATCGTGCATTCCTC 25AACCGTGTCCTGCCTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGCAAAGTCCACAAGAATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGCAGAAAGGAGATTTCGGCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTGCCAAACCTGCATTGGATGAAGCAATGAAGATCGCGAAAGAAAACGGAAATGAATCGGACATAGAGGTTCTTGAGGCATTCAACGGTTTCTCCGTATACTTCACCGGATATCATGAGAGCAGGGAGAACATCTATTCGGACGAGGATATGGTGTCGGTAGCTTATCGCATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTTATATTCGATAAGCTGAATGAGTCGCACCCCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTAGACGACATCGGAAAATATTTTGATGTGTCTAACTACAATAATTTCCTGTCGCAGGCCGGTATAGATGACTACAATCACATCATCGGCGGCCATACGACGGAGGACGGTCTGATCCAGGCATTCAATGTTGTTCTGAATCTCAGGCATCAGAAAGACCCCGGATTCGAAAAAATCCAATTCAAACAGCTGTACAAACAGATACTCAGCGTCCGTACATCCAAATCCTATATCCCGAAACAGTTCGATAATTCGAAGGAGATGGTGGACTGCATCTGCGACTATGTGTCCAAGATCGAAAAATCCGAAACGGTCGAGAGAGCATTGAAGCTGGTAAGGAACATATCTTCTTTTGATTTGCGCGGAATATTCGTAAACAAGAAGAATCTCCGCATTCTTTCCAACAAACTGATTGGTGATTGGGACGCGATCGAAACCGCGCTGATGCACTCCTCCTCTTCGGAAAATGATAAGAAATCCGTCTACGACAGCGCCGAGGCATTTACGCTGGATGATATCTTTTCGTCCGTTAAAAAATTCTCAGATGCATCTGCAGAGGATATCGGAAACCGGGCGGAGGACATATGCAGAGTCATATCTGAGACCGCTCCGTTCATAAACGATCTGAGGGCTGTCGATTTGGACAGTTTGAATGACGACGGTTACGAGGCGGCGGTTTCCAAGATAAGGGAATCTCTGGAACCATATATGGATCTGTTTCATGAACTGGAGATATTCTCCGTAGGCGATGAATTCCCGAAATGTGCAGCTTTCTACAGTGAACTTGAAGAAGTCTCCGAACAGCTAATCGAGATTATACCGTTATTCAACAAGGCCCGTTCGTTCTGTACGCGCAAGAGATACAGTACGGACAAGATAAAGGTCAATTTGAAATTCCCGACACTCGCCGACGGATGGGATCTCAACAAAGAACGCGACAACAAAGCCGCAATACTCAGGAAAGACGGAAAGTACTACCTGGCCATACTGGATATGAAGAAAGATCTTTCTTCGATCAGAACTTCGGATGAAGACGAATCCAGTTTTGAGAAAATGGAGTACAAGCTTCTTCCGAGTCCGGTAAAGATGCTGCCAAAGATCTTCGTAAAATCGAAGGCGGCCAAGGAGAAGTACGGTCTGACCGACCGTATGCTGGAGTGCTACGATAAAGGGATGCACAAGAGCGGCAGTGCATTCGATCTCGGATTTTGTCACGAATTGATCGATTACTACAAGAGGTGCATCGCAGAATATCCCGGCTGGGACGTCTTCGATTTCAAGTTCAGGGAAACATCGGATTATGGCAGCATGAAGGAGTTCAATGAGGATGTTGCAGGGGCCGGATACTATATGTCCCTCAGAAAGATCCCTTGTTCGGAGGTCTACAGGCTTCTTGATGAGAAATCGATATATCTTTTCCAGATCTACAACAAAGATTATTCGGAAAACGCTCATGGGAATAAGAACATGCATACCATGTATTGGGAAGGGCTCTTTTCCCCCCAGAATCTGGAATCCCCTGTGTTTAAACTCAGCGGCGGTGCGGAGCTTTTCTTCCGTAAATCCTCCATACCCAATGACGCCAAAACGGTCCATCCGAAGGGAAGCGTCCTGGTTCCGCGCAATGATGTAAACGGCCGCAGGATACCTGACAGCATATATCGGGAGCTCACCAGATATTTCAACCGCGGAGATTGCCGCATAAGCGACGAGGCAAAGAGTTATCTGGACAAGGTGAAAACCAAGAAAGCTGACCACGATATCGTGAAAGACAGGAGGTTCACGGTGGACAAGATGATGTTCCACGTCCCTATCGCCATGAATTTCAAAGCGATTTCGAAGCCGAATCTCAATAAAAAGGTGATTGACGGCATAATCGACGACCAAGATCTGAAGATCATCGGCATAGACCGCGGAGAGCGCAACCTCATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCTCTATCAGGATAGCCTCAATATTCTGAACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGACAATAAAGAGGCTCGGAGGAACTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGAGGGGTATCTGTCGCTTGCAGTCAGCAAATTGGCAGATATGATCATAGAGAACAATGCGATTATCGTCATGGAGGATCTCAATCACGGATTCAAGGCAGGGCGTTCGAAGATAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCATAAACAAACTCGGTTACATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGGATACCAGCTTGCCAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGATATTCTACATCCCTGCTGCATTTACATCCAAGATAGATCCGACAACAGGATTTGCAGATCTGTTCGCCCTCAGCAATGTTAAAAACGTGGCATCTATGAGAGAATTTTTCTCCAAGATGAAGTCTGTAATCTATGATAAGGCGGAGGGAAAATTCGCATTTACCTTCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTTGGACCGTGTATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGAAAAGTTCCGACAGACATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTGAAGGGGATCTCAGGGACAGGATTGCTGAATCGGATGGCGACACTCTGAAGAGCATATTCTATGCATTCAAGTATGCATTGGATATGAGAGTAGAGAACCGCGAAGAGGATTACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTCTGTTCCAAGAACGCAGGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACTCAAGGGGATCCTGCAGCTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAGAGCATACGGTTGCCACTGATAACCAACAAGGCCTGGCTGACCTTTATGCAGTCCGGTATGAAGACATGGAAGAACTGA SEQatgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGTID TGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAAGTTNO:ATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGC26 TAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGCAACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga SEQatgaataatggaacaaataactttcagaattttatcggaatttcttctttgcagaagactcttaggaatgctctcaIDttccaaccgaaacaacacagcaatttattgttaaaaacggaataattaaagaagatgagctaagaggagaaaatcgNO:tcagatacttaaagatatcatggatgattattacagaggtttcatttcagaaactttatcgtcaattgatgatatt27gactggacttctttatttgagaaaatggaaattcagttaaaaaatggagataacaaagacactcttataaaagaacagactgaataccgtaaggcaattcataaaaaatttgcaaatgatgatagatttaaaaatatgttcagtgcaaaattaatctcagatattcttcctgaatttgtcattcataacaataattattctgcatcagaaaaggaagaaaaaacacaggtaattaaattattttccagatttgcaacgtcattcaaggactattttaaaaacagggctaattgtttttcggctgatgatatatcttcatcttcttgtcatagaatagttaatgataatgcagagatattttttagtaatgcattggtgtataggagaattgtaaaaagtctttcaaatgatgatataaataaaatatccggagatatgaaggattcattaaaggaaatgtctctggaagaaatttattcttatgaaaaatatggggaatttattacacaggaaggtatatctttttataatgatatatgtggtaaagtaaattcatttatgaatttatattgccagaaaaataaagaaaacaaaaatctctataagctgcaaaagcttcataaacagatactgtgcatagcagatacttcttatgaggtgccgtataaatttgaatcagatgaagaggtttatcaatcagtgaatggatttttggacaatattagttcgaaacatatcgttgaaagattgcgtaagattggagacaactataacggctacaatcttgataagatttatattgttagtaaattctatgaatcagtttcacaaaagacatatagagattgggaaacaataaatactgcattagaaattcattacaacaatatattacccggaaatggtaaatctaaagctgacaaggtaaaaaaagcggtaaagaatgatctgcaaaaaagcattactgaaatcaatgagcttgttagcaattataaattatgttcggatgataatattaaagctgagacatatatacatgaaatatcacatattttgaataattttgaagcacaggagcttaagtataatcctgaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcgatgtaataatgaatgcttttcattggtgttcggttttcatgacagaggagctggtagataaagataataatttttatgccgagttagaagagatatatgacgaaatatatccggtaatttcattgtataatcttgtgcgtaattatgtaacgcagaagccatatagtacaaaaaaaattaaattgaattttggtattcctacactagcggatggatggagtaaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtactatttaggaatatttaatgcaaaaaataagcctgacaaaaagataattgaaggtaatacatcagaaaataaaggggattataagaagatgatttataatcttctgccaggaccaaataaaatgatccccaaggtattcctctcttcaaaaaccggagtggaaacatataagccgtctgcctatatattggagggctataaacaaaacaagcatattaaatcctctaaggattttgatataacattttgtcacgatttgattgattattttaagaactgtatagcaatacatcctgaatggaagaattttggctttgatttttctgacacctccacatatgaagatatcagcggattttacagagaagtcgaattacaaggttataaaatcgactggacatatatcagcgaaaaggatattgatttgttgcaggaaaaaggacagttatatttattccaaatatataacaaagatttttccaagaaaagtaccggaaatgataatcttcatactatgtatttgaagaatttgtttagtgaagagaatttaaaggatattgtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaagcataaagaatccaataattcataaaaaaggctctattcttgttaatagaacatatgaagcagaggaaaaagatcaatttggaaatatccagatagtcagaaaaaacataccggaaaatatatatcaggagctttataaatatttcaatgataaaagtgataaagaactttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgctacaaacatagtaaaagattatagatatacatatgataaatattttcttcatatgcctattacaatcaattttaaagccaataagacaggctttattaatgacagaatattacaatatattgctaaagaaaaggatttgcatgtaataggcattgatcgtggtgaaagaaacctgatatatgtttcagtaattgatacttgtggaaatattgttgaacaaaaatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcaggagggggcgcgacaaatcgcacgaaaagaatggaaagaaatcggcaaaataaaagaaattaaagaaggctatttatctcttgtaattcatgaaatttcaaagatggttattaaatataatgccataattgcaatggaggatttaagctacggatttaaaaaaggtcgtttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaacaaactcaactatctggtatttaaagatatatccataacggaaaacggtggtcttctaaagggataccagcttacatatattccagataaactgaaaaatgtgggtcatcaatgtggctgtatattttatgtacctgctgcctatacatcaaaaatagatcctacaaccggatttgtaaatatattcaaatttaaagatttaacagttgatgcgaagagagaatttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgttttacattcgattataataactttattacgcaaaatactgttatgtcaaagtcaagctggagtgtatatacgtacggagttaggataaaaagaagatttgtcaatggcaggttctcaaatgaatcggatacaattgatataacaaaagatatggaaaaaacactcgaaatgacagatataaattggagagatggtcatgatctgaggcaggatattattgattatgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgagaaacagtttaagtgaattagaagacagggattatgaccgtttgatttctccggtgctcaatgaaaataatatattttatgattcagctaaagcaggagatgcgttacctaaagacgcagatgctaatggtgcatattgtatagctctaaaaggcttgtatgaaatcaaacaaattacagagaattggaaagaagacggtaagttttcaagagataaacttaaaatttccaataaggactggtttgactttattcaaaataaaaggtatttataa SEQatgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagttgattccacaaggaaaaacatIDtggaatttattcaagaaaaaggattgctctctcaagataaacaacgagcggagagttatcaagaaatgaaaaaaacNO:tattgataaatttcataaatactttatcgatttagctttaagcaatgctaaactaactcatttagaaacttacttg28gaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaaagacgatttaaagaaagtacaagacaatttacgaaaagaaatcgttaaatctttttcagatggtgatgcaaaatcaatttttgcaattttggataaaaaagaactgattaccgtagaacttgaaaaatggtttgaaaacaacgaacaaaaagacatttattttgacgaaaaattcaaaacgtttactacttattttactggttttcatcaaaacagaaaaaacatgtattcggttgaacccaattctacagcaattgcttatcgattgattcatgaaaatttacctaaatttttagaaaatgctaaagcatttgaaaaaataaaacaagtagaaagtttgcaagttaattttagagaattaatgggggaatttggagatgaagggctaattttcgtaaatgaattagaagaaatgtttcaaatcaattattataatgatgtgctttcacaaaatggaattacaatttataatagtataatttcaggatttaccaaaaatgatataaaatataaaggtctaaatgaatacataaataattacaatcaaaccaaagacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttgagtgataggatttcactttcgtttttgcccgatgcttttacggatgggaaacaagttttgaaagccatatttgacttttataaaatcaacttactttcttataccattgaaggacaggaagaaagccaaaatcttttactattaattcgtcagacaattgaaaacctttctagttttgatacccaaaaaatttatctaaaaaatgatacccatttaaccactatttcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattattggtatgaaactaaagtaaatccaaaatttgaaacggaatatagcaaagccaacgaaaaaaaacgagaaattttagataaagccaaagcggtatttacaaaacaagattatttttcaattgcttttttacaagaagtactttcggaatacattcttaccttagatcacacttctgatattgtaaaaaagcattcctccaactgtattgcggattattttaaaaatcattttgtagccaaaaaagaaaatgaaaccgacaaaacctttgattttattgctaatattactgcaaaataccaatgtattcaaggtattttagaaaatgcagaccaatacgaagacgaactcaaacaagaccaaaaattaattgataatttgaaattctttttagatgctattttagaattgttgcattttattaaacctttgcatttaaaatcagaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattacgaagcattgagtttgttgaccccattatataatatggtgcgaaactatgtaacgcaaaagccgtacagcaccgaaaaaataaaattaaattttgaaaatgcacaattattgaatggttgggatgccaataaagaaggtgattacctaactaccattttgaaaaaagacggtaattattttttagccataatggataaaaagcataacaaagcgtttcaaaagtttccagaaggaaaagaaaattatgaaaaaatggtgtataaactattgcctggagtaaataagatgttgccaaaagtatttttttccaataaaaatattgcttacttcaacccatcaaaagagttattagaaaactataaaaaagagacgcacaaaaaaggagacacattcaatttagaacattgtcatacgttgatcgattttttcaaggactctttaaacaaacatgaagactggaaatactttgattttcaattttctgaaacaaaatcgtatcaagatttgagtggtttttatagagaagtagaacatcaaggctacaaaatcaattttaaaaatatcgattcagaatatattgatggtttggtgaacgaaggtaaattgtttctatttcaaatttacagcaaagatttttcgcctttttccaaagggaaaccgaacatgcacactttgtattggaaagccttatttgaagaacaaaatttgcaaaatgtaatctataaattgaatggacaagccgaaatattttttagaaaagcctctataaaacctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcattttattgataaaaaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacctcaatatgtactaccaaggaaaaataagtgaaaaagaattaacacaagatgatttaaggtatattgataattttagcattttcaatgaaaaaaataaaacaattgatattataaaagacaaacgatttacggttgataaatttcagtttcatgtgccgattaccatgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatatttgcaaaacaatcccgaagttaagattattggattggatagaggcgaacgccatttggtatatctgacactgatagaccagcaaggaaacatcttgaaacaagaaagtttgaatacaatcaccgattctaaaatctcgacaccttatcataagttgttggataacaaggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatcaaagaactcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgctggaagaaaatgccattgtggtaatggaagatttgaattttggatttaaacgtggacgttttaaagtggaaaaacaaatttatcaaaagctggaaaaaatgttgattgacaaattgaattatttggttttaaaagacaaacaacctcaggaattaggcggattgtacaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaacaatcgggctttttgttttatgtacccgcttggaacacctccaaaatagacccaaccacagggtttgtcaattatttttataccaaatatgaaaatgttgacaaagccaaagccttttttgaaaaatttgaggcgattcgtttcaatgcagaaaagaagtattttgaatttgaagtaaaaaaatatagcgattttaacccaaaagccgaaggcactcaacaagcctggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagaccaaaacaacaaatttgtaagcactccaattaatctaaccgaaaagatagaagactttttgggtaaaaaccaaattgtttatggtgatggtaattgcatcaaatctcaaattgctagcaaagacgacaaggcttttttgaaaccttattgtattggttcaaaatgactttacaaatgcgaaacagcgaaacaagaacagatatagattatctaatttcgcccgtgatgaatgacaacggaacattttacaacagccgagattatgaaaaattagaaaatccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaaaggattgatgcttttgaataaaatagaccaagccgacttgacaaaaaaagtggatttatctattagtaacagagattggttgcaatttgtacaaaaaaataaataa SEQatggaacaggagtactatttaggactggatatgggaaccggatctgtaggatgggctgttacagattcggaatatcIDatgtcttgcgtaaacatggaaaagcactatggggagtccgattatttgaaagtgcatcgacagcagaagaacgaagNO:aatgttccgaacatcaagaagaagactagatcgaagaaactggagaattgaaattttacaggaaatttttgcagag29gaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaagcaaatattatccagaagataagcgagatatcaatggaaattgtccggaactgccatatgcattatttgttgatgacgattttacagataaagattatcataaaaaatttccgacaatttatcatctcaggaaaatgttgatgaatacagaggagacaccggatatccggttggtgtatctggcaattcatcatatgatgaagcataggggccatttcttgttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgttggagaatatcaaaaatgaggaattggattggaatcttgaactgggaaaagaagaatatgctgttgtagaaagtattttaaaagataacatgttaaaccgatccacaaagaaaaccagattaataaaagcattaaaagcaaaatcaatatgtgaaaaggctgtactgaatttattggctggtggaacggtgaaattgagtgatatatttggtcttgaagaattaaatgagacagaaagaccgaagatttcctttgctgataatggatacgatgattatatcggagaagttgaaaatgagctgggagaacaattctatattatagagacggcaaaagcagtgtatgactgggcggtattagttgaaatattgggaaaatatacgtcaatttcagaagcgaaagtagcaacgtatgaaaaacataaatcggatttacaatttttgaaaaagatagttcggaaatatctgacaaaggaggaatataaagatatttttgtaagtacgagtgacaaattgaaaaattactctgcttatataggaatgacgaaaataaatggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctatgattttattaagaaaaacgtacttaaaaagctagaaggacaacctgaatatgaatatttgaaagaagagctagaaagagaaacatttctaccaaaacaggtgaacagggataatggtgtaataccgtatcagattcatttgtacgagttgaaaaagatattaggaaatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaattatttgaattcagaattccgtattatgttggtccgctgaataagatagatgacggaaaagagggaaaatttacatgggctgtacggaaaagtaatgaaaagatatatccatggaattttgaaaatgtagttgatatagaagcaagtgcagaaaaatttatccggagaatgacaaataagtgtacatatctgatgggcgaagatgtattgccgaaggattcattgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggatggcgaaaaattatctgtagaattgaaacaacggttgtatacagatgtattttgtaagtatcggaaagtaactgtaaagaagataaaaaattacttgaaatgtgaaggtatcatatccggcaatgtcgaaataactggaattgatggtgattttaaggcatcgttaacggcatatcatgattttaaagaaatcttgacaggaacagaattggctaaaaaggacaaagaaaatattattaccaatatagtattgtttggagatgataaaaagctgctgaaaaagagactgaatcgattatatcctcagattacgccgaatcagttgaagaaaatatgtgcgctatcctatacaggctggggaagattttctaaaaagttcttagaagaaataacagctccagatccggaaacgggagaggtatggaatatcattacggcattgtgggaatcgaataataatctgatgcaattattaagtaatgaatatcggtttatggaagaagtcgaaacatacaatatgggaaaacagactaaaacattgtcgtacgaaacagtagagaatatgtatgtttctccatctgtgaaaagacagatatggcagacgctgaaaatcgtgaaagaattagaaaaagtaatgaaagaatctccgaaacgtgtatttattgagatggcgagagaaaagcaagaaagtaagagaaccgaatcgcgtaaaaaacaactaatagatttgtataaggcttgtaaaaatgaagaaaaagattgggtaaaagaactgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattatacgcaaaagggtcgttgtatgtattctggcgaggtaatagaactgaaagacttatgggataatacaaaatatgatattgatcatatatatccacaatctaaaacgatggatgacagtcttaataatcgcgtattggtaaaaaagaaatataatgcaacaaaatcagataagtatccattaaatgaaaatatacgacatgagagaaaaggcttttggaagtcactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaatacagaattgagtccggaagaattagcaggatttattgaaaggcagattgttgaaacgaggcagagtacaaaagctgtagcggaaatattaaagcaagtgtttccggaaagtgaaattgtatatgtcaaagcaggtacggtttcaagattcagaaaagattttgaattactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtatttaaatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggtttataaaagaaaatccgggacgtacttacaacttaaaaaagatgtttacatcaggttggaatattgaacgaaatggagaagttgcatgggaagtcgggaaaaaaggaacaattgtaacggtaaaacaaataatgaataaaaataatatattggtgacaagacaggttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaaggtcagattgctataaaggaaactgatgaacgtcttgcatcaatagaaaagtatggaggctataataaagctgccggggcatattttatgctggtagaatctaaagataaaaaaggaaaaacaattcgaacgatagaatttataccattatatttaaagaataaaatcgagtcggatgaatcaatagcattgaactttttagaaaaaggcagaggtttgaaagaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggacggattcaaaatgtggttgtctggaagaacaggggacagactactatttaaatgtgcaaatcaattgattttggatgagaaaataattgtaacaatgaaaaaaattgtaaagtttattcaaaggagacaagaaaatagagaattaaaattatctgataaagatggaattgataatgaagtacttatggaaatatataacacttttgtggataagttagaaaacacagtgtatagaatacgattatccgaacaggcaaaaacgcttatagataaacaaaaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattttacatatttttcagtgtcaaagtagtgcggccaatttaaaaatgataggcggacctggaaaagcaggaatattagttatgaataataatataagtaagtgtaacaaaatttctattataaatcagtctccaacaggaattttcgaaaatgagattgatttgttaaagat SEQATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAATTTGAGATGAGACCTGTID CGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATTTGAAAAAGACAAACTAATTCAAAAAAAGTNO: ACGGAAAAACAAAGCCGTATTTCGACAGACTCCACAGAGAATTTATAGAAGAAGCGCTCACGGGGGTA30 GAGCTAATAGGACTAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAGAAAAATAATGTCGCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAAAATATTTAACCTAAAGGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGCTGAAAAATAAAAATACCGATATTTTATTCGAAGAGGCTGTATTCGGGATATTGAAAGCCCGATATGGAGAAGAAAAAGATACTTTTATAGAAGTAGAGGAAATAGATAAAACCGGCAAATCAAAGATCAATCAAATATCAATTTTCGATAGTTGGAAAGGATTTACAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTACAAAAACGACGGAACTTCTACAGCAATTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATAGATAATCTGTCAATAGTTGAAAGTGTGAGACAAAAGGTTGATCTCGCCGAGACAGAAAAATCTTTCAGCATATCTCTATCGCAATTCTTCTCAATAGACTTTTATAACAAGTGTCTCCTTCAAGATGGTATTGATTACTACAACAAGATAATCGGTGGAGAAACTCTCAAAAATGGCGAAAAACTAATAGGTCTCAATGAACTAATAAATCAATATAGGCAGAATAATAAGGATCAGAAAATCCCATTTTTCAAACTTCTTGATAAACAAATTTTGAGTGAAAAGATATTATTTTTGGATGAAATAAAAAATGACACAGAACTGATCGAGGCGCTGAGTCAGTTCGCAAAAACAGCCGAAGAAAAAACAAAAATTGTCAAAAAGCTTTTTGCCGATTTTGTAGAAAATAATTCCAAATACGATCTTGCACAGATTTATATTTCCCAAGAAGCATTCAATACTATATCAAACAAGTGGACAAGCGAAACTGAGACGTTCGCTAAATATCTATTCGAAGCAATGAAGAGTGGAAAACTTGCAAAGTATGAGAAAAAAGATAATAGCTATAAATTTCCTGATTTTATTGCCCTTTCACAGATGAAGAGTGCTTTATTAAGTATCAGCCTTGAGGGACATTTTTGGAAAGAGAAATACTACAAAATTTCAAAATTCCAAGAGAAGACCAATTGGGAGCAGTTTCTTGCAATTTTTCTATACGAGTTTAACTCTCTTTTCAGCGACAAAATAAATACAAAAGATGGAGAAACAAAGCAAGTTGGATACTATCTATTTGCCAAAGACCTGCATAATCTTATCTTAAGTGAGCAGATTGATATTCCAAAAGATTCAAAAGTCACAATAAAAGATTTTGCCGATTCTGTACTCACAATCTACCAAATGGCAAAATATTTTGCGGTAGAAAAAAAACGAGCGTGGCTTGCCGAGTATGAACTAGATTCATTTTATACCCAGCCAGACACAGGCTATTTACAGTTTTATGATAACGCCTACGAGGATATTGTGCAGGTATACAACAAGCTTCGAAACTATCTGACCAAAAAGCCATATAGCGAGGAGAAATGGAAGTTGAATTTTGAAAATTCTACGCTGGCAAATGGATGGGATAAGAACAAAGAATCTGATAATTCAGCAGTTATTCTACAAAAAGGTGGAAAATATTATTTGGGACTGATTACTAAAGGACACAACAAAATTTTTGATGACCGTTTTCAAGAAAAATTTATTGTGGGAATTGAAGGTGGAAAATATGAAAAAATAGTCTATAAATTTTTCCCCGACCAGGCAAAAATGTTTCCCAAAGTGTGCTTTTCTGCAAAAGGACTCGAATTTTTTAGACCGTCTGAAGAAATTTTAAGAATTTATAACAATGCAGAGTTTAAAAAAGGAGAAACTTATTCAATAGATAGTATGCAGAAGTTGATTGATTTTTATAAAGATTGCTTGACTAAATATGAAGGCTGGGCATGTTATACCTTTCGGCATCTAAAACCCACAGAAGAATACCAAAACAATATTGGAGAGTTTTTTCGAGATGTTGCAGAGGACGGATACAGGATTGATTTTCAAGGCATTTCAGATCAATATATTCATGAAAAAAACGAGAAAGGCGAACTTCACCTTTTTGAAATCCACAATAAAGATTGGAATTTGGATAAGGCACGAGACGGAAAGTCAAAAACAACACAAAAAAACCTTCATACACTCTATTTCGAATCGCTCTTTTCAAACGATAATGTTGTTCAAAACTTTCCAATAAAACTCAATGGTCAAGCTGAAATTTTTTATAGACCGAAAACGGAAAAAGACAAATTAGAATCAAAAAAAGATAAGAAAGGGAATAAAGTGATTGACCATAAACGCTATAGTGAGAATAAGATTTTTTTTCATGTTCCTCTCACACTAAACCGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAACAACTTTCTCGCAAATAATAAAGATATCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCTATTATTCGGTGATTACACAAGCTAGTGACATCTTAGAAAGTGGCTCACTAAATGAGCTAAATGGCGTGAATTATGCTGAAAAACTGGGAAAAAAGGCAGAAAATCGAGAACAAGCACGCAGAGACTGGCAAGACGTACAAGGGATCAAAGACCTCAAGAAAGGATATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTAAACACAATGCCATTATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAAAAATCCATTTATCAACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTGTAGACAAAGGTGAAAAAAATCCCGAGCAAGCAGGACATCTTCTGAAAGCATATCAGCTTTCGGCCCCATTTGAGACATTTCAAAAAATGGGCAAACAGACGGGTATAATCTTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAACAGGTTGGCGACCACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAAGTTTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAGGACTTTCAGCAAGCAAAAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAAATGTAGAGAGATTCAGGTGGGACAAAAACCTCAATCAAAACAAAGGCGGATATACTCACTACACAAATATAACTGAGAATATCCAAGAGCTTTTTACAAAATATGGAATTGATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAAACAAAATACCTCATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAATACCGATGATTCTGAGATTGCTAAAAAGAATGGGAAAGATGATTTTATACTGTCACCTGTTGAGCCGTTTTTCGATAGCCGAAAAGACAATGGAAATAAACTTCCTGAGAATGGAGATGATAACGGCGCGTATAACATAGCAAGAAAAGGGATTGTCATACTCAACAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCGAGAAAATGAAATGGGGGGATTTGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTAATGCACGGCATTAA SEQATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGGAAAATTTTAAAAACTID TGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACCCTATGGAAAAACATTGGAAAATTTTAAAANO:AATCCGGACTTTTAGAAAAAGATGCCTTTAAGGCAAATAGTAGACGAAGTATGCAAGCTATAATCGATG 31AAAAATTCAAAGAGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTGATCTTGGAAACATGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAGCAAGTTATCTTATCTTTTGACGATGAAATATTTAATAATTACCTAAAACCTGATAAAAATATTGACGCATTATTTAAAAATGATCCTTCAAATCCTGTAATCTCTACATTTAAAGGTTTTACGACATATTTTGTGAATTTTTTTGAAATTCGAAAACATATTTTCAAGGGAGAATCATCAGGCTCAATGGCATACCGAATTATAGATGAAAACCTGACAACATACTTGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTAAAATCACAGCTAGAAGGCATTGATCAGATTGATAAACTTAATAATTATAATGAGTTCATTACACAGTCAGGTATAACACACTATAATGAAATCATCGGCGGTATATCAAAATCAGAGAATGTCAAAATACAGGGAATTAATGAAGGAATTAATCTATACTGTCAGAAGAACAAAGTTAAACTTCCTCGACTGACTCCGCTATACAAAATGATATTATCAGACAGAGTTTCCAACTCTTTTGTATTAGACACTATTGAAAATGACACAGAATTAATTGAAATGATAAGTGATTTGATTAATAAGACTGAGATTTCGCAAGATGTTATAATGTCAGATATTCAAAATATTTTCATAAAATACAAACAACTTGGTAATTTGCCGGGTATCTCATATTCTTCAATAGTTAATGCTATTTGCTCGGATTATGACAACAATTTCGGAGATGGGAAGCGAAAAAAATCTTACGAAAATGATCGCAAAAAGCATTTGGAGACTAATGTATACTCCATAAATTATATTTCTGAATTGCTTACAGATACCGATGTTTCATCAAATATCAAGATGAGATATAAAGAGCTTGAGCAAAATTATCAGGTTTGCAAAGAAAATTTTAATGCCACAAACTGGATGAATATTAAAAATATAAAACAATCTGAAAAAACAAACCTTATTAAAGATTTGTTAGATATACTTAAATCGATTCAACGTTTCTATGATTTGTTTGATATTGTTGACGAAGATAAAAATCCAAGTGCTGAATTTTATACCTGGTTATCAAAAAATGCTGAAAAGCTTGACTTTGAATTCAATTCTGTATATAACAAGTCACGAAACTATCTCACCAGGAAACAATACTCTGATAAAAAAATCAAGCTGAATTTTGATTCTCCAACATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGATAACTCCACGATTATAATGCGTAAATTTAATAATGACAGAGGCGATTATGATTACTTCCTTGGCATATGGAATAAATCCACACCTGCAAATGAAAAAATAATCCCACTGGAGGATAATGGATTATTCGAAAAAATGCAATATAAGCTGTATCCAGATCCTAGTAAGATGTTACCGAAACAATTTCTATCAAAAATATGGAAGGCAAAGCATCCTACGACACCTGAATTTGATAAAAAATATAAAGAGGGAAGACATAAAAAAGGTCCTGATTTCGAAAAAGAATTCCTGCATGAATTGATTGATTGCTTCAAACATGGTCTTGTTAATCACGATGAAAAATATCAGGATGTTTTTGGCTTCAATCTCCGTAACACTGAAGATTATAATTCATATACAGAGTTTCTCGAAGATGTGGAAAGATGCAATTACAATCTTTCATTTAACAAAATTGCTGATACTTCAAACCTTATTAATGATGGGAAATTGTATGTATTTCAGATATGGTCAAAAGACTTTTCTATTGATTCAAAAGGTACTAAAAACTTGAATACAATCTATTTTGAATCACTATTTTCAGAAGAAAACATGATAGAAAAAATGTTCAAGCTTTCTGGAGAGGCTGAGATATTCTATCGACCAGCATCGTTGAATTATTGTGAAGATATCATAAAAAAAGGTCATCACCATGCAGAATTAAAAGATAAGTTTGACTATCCTATAATAAAAGATAAGCGATATTCACAAGATAAGTTTTTCTTTCATGTGCCAATGGTTATAAATTATAAATCTGAGAAACTGAATTCCAAAAGCCTTAACAACCGAACAAATGAAAACCTGGGACAGTTTACACATATTATAGGTATAGACAGGGGCGAGCGGCACTTGATTTATTTAACTGTTGTTGATGTTTCCACTGGTGAAATCGTTGAACAGAAACATCTGGACGAAATTATCAATACTGATACCAAGGGAGTTGAACACAAAACCCATTATTTGAATAAATTGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAATCATGGGAAGCTATTGAAACTATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATACAAAAGCTGCAAGAAAAATATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGTTCAAAAACTCACGAATCAAAGTTGAAAAACAGGTTTATCAAAAATTCGAGACAGCATTGATTAAAAAGTTCAATTATATTATTGATAAAAAAGATCCAGAAACCTATATACATGGTTACCAGCTTACAAATCCTATTACCACTCTGGATAAGATTGGAAATCAATCTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGTCACAGGATTTGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAGGCCAAATCATTCATTCAGAAAATAGACAACATATATTTTGAAAATGGAGAGTTTAAATTTGATATTGATTTTTCCAAATGGAATAATCGCTACTCAATAAGTAAAACTAAATGGACGTTAACAAGTTATGGGACTCGCATCCAGACATTTAGAAATCCCCAGAAAAACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTAAATTAATTTTAAATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCATGTCTTTATTTAAACTAATGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATTGATTATATGATCTCTCCTGTCACTGATAAAACAGGAACACATTTCGATTCAAGAGAAAATATTAAAAATCTTCCTGCCGATGCAGATGCCAATGGTGCCTACAACATTGCGCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTATAAGCGATCCACTAAAAATAAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA SEQATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTTTTGAACTGATTCCCCID AAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCATTGAGGAGGATAAAGCTCGCAATGACCATNO:TACAAAGAGTTAAAACCAATCATTGACCGCATCTATAAGACTTATGCTGATCAATGTCTCCAACTGGTAC32 AGCTTGACTGGGAGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGAAGAAACACGAAATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTTATAGGTCGGACGGATAATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCTATAAAGGACTTTTTAAAGCTGAACTTTTCAATGGAAAAGTTTTAAAGCAATTAGGGACCGTAACCACGACAGAACATGAAAATGCTCTACTCCGTTCGTTTGACAAATTTACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAAAATGTCTTTAGCGCTGAAGATATCAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTAAATTTAAGGAAAACTGCCATATTTTTACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAGCATTTTGAAAATGTCAAAAAGGCCATTGGAATCTTTGTTAGTACGTCTATTGAAGAAGTCTTTTCCTTTCCCTTTTATAATCAACTTCTAACCCAAACGCAAATTGATCTTTATAATCAACTTCTCGGCGGCATATCTAGGGAAGCAGGCACAGAAAAAATCAAGGGACTTAATGAAGTTCTCAATCTGGCTATCCAAAAAAATGATGAAACAGCCCATATAATCGCGTCCCTGCCGCATCGTTTTATTCCTCTTTTTAAACAAATTCTTTCCGATCGAAATACGTTATCCTTTATTTTGGAAGAATTCAAAAGCGATGAGGAAGTCATCCAATCCTTCTGCAAATATAAAACCCTCTTGAGAAACGAAAATGTACTGGAGACTGCAGAAGCCCTTTTCAATGAATTAAATTCCATTGATTTGACTCATATCTTTATTTCCCATAAAAAGTTAGAAACCATCTCTTCAGCGCTTTGTGACCATTGGGATACCTTGCGCAATGCACTTTACGAAAGACGGATTTCTGAACTCACTGGCAAAATAACAAAAAGTGCCAAAGAAAAAGTTCAAAGGTCATTAAAACATGAGGATATAAATCTCCAAGAAATTATTTCTGCTGCAGGAAAAGAACTATCAGAAGCATTCAAACAAAAAACAAGTGAAATTCTTTCCCATGCCCATGCTGCACTTGACCAGCCTCTTCCCACAACATTAAAAAAACAGGAAGAAAAAGAAATCCTCAAATCACAGCTCGATTCGCTTTTAGGCCTTTATCATCTTCTTGATTGGTTTGCTGTCGATGAAAGCAATGAAGTCGACCCAGAATTCTCAGCACGGCTGACAGGCATTAAACTAGAAATGGAACCAAGCCTTTCGTTTTATAATAAAGCAAGAAATTATGCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAAATTGAATTTTCAAATGCCAACCCTTGCCTCTGGTTGGGATGTCAATAAAGAAAAAAATAATGGAGCTATTTTATTCGTAAAAAATGGTCTCTATTACCTTGGTATCATGCCTAAACAGAAGGGGCGCTATAAAGCCCTGTCTTTTGAGCCGACAGAAAAAACATCAGAAGGATTCGATAAGATGTACTATGACTACTTCCCAGATGCCGCAAAAATGATTCCTAAGTGTTCCACTCAGCTAAAGGCTGTAACCGCTCATTTTCAAACTCATACCACCCCCATTCTTCTCTCAAATAATTTCATTGAACCTCTTGAAATCACAAAAGAAATTTATGACCTGAACAATCCTGAAAAGGAGCCTAAAAAGTTTCAAACGGCTTATGCAAAGAAGACAGGCGATCAAAAAGGCTATAGAGAAGCGCTTTGCAAATGGATTGACTTTACGCGGGATTTTCTCTCTAAATATACGAAAACAACTTCAATCGATTTATCTTCACTCCGCCCTTCTTCGCAATATAAAGATTTAGGGGAATATTACGCCGAACTGAATCCGCTTCTCTATCATATCTCCTTCCAACGAATTGCTGAAAAGGAAATCATGGATGCTGTAGAAACGGGAAAATTGTATCTGTTCCAAATCTACAATAAGGATTTTGCGAAGGGCCATCACGGGAAACCAAATCTCCACACCCTGTATTGGACAGGTCTCTTCAGTCCTGAAAACCTTGCGAAAACCAGCATCAAACTTAATGGTCAAGCAGAATTGTTCTATCGACCTAAAAGCCGCATGAAGCGGATGGCCCATCGTCTTGGGGAAAAAATGCTGAACAAAAAACTAAAGGACCAGAAGACACCGATTCCAGATACCCTCTACCAAGAACTGTACGATTATGTCAACCACCGGCTAAGCCATGATCTTTCCGATGAAGCAAGGGCCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGAAATTATAAAGGATCGGCGGTTTACTTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAAGCAGCCAATAGTCCCAGTAAATTCAACCAGCGTGTCAATGCCTACCTTAAGGAGCATCCGGAAACGCCCATCATTGGTATCGATCGTGGAGAACGCAATCTAATCTATATTACCGTCATTGACAGTACTGGGAAAATTTTGGAGCAGCGTTCCCTGAATACCATCCAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAGAGCGTGTTGCCGCCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTTGTCACAGGTCATCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTCGTCCTTGAAAACCTCAACTTCGGATTTAAATCAAAACGGACAGGCATTGCCGAAAAAGCAGTCTACCAACAATTTGAAAAGATGCTAATAGATAAACTCAACTGTTTGGTTCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTTAAACCCGTATCAACTTACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTTTCTATGTACCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCCCTTTGTATGGAAGACCATTAAAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGATTTGATTTCCTGCATTATGATGTCAAAACAGGTGATTTTATCCTCCATTTTAAAATGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGGCTTCATGCCAGCTTGGGATATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCCTTCATTGCAGGAAAACGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACAGAGACCTCTATCCCGCTAATGAACTCATTGCCCTTCTGGAAGAAAAAGGCATTGTCTTTAGAGACGGAAGTAATATATTACCCAAACTTTTAGAAAATGATGATTCTCATGCAATTGATACGATGGTCGCCTTGATTCGCAGTGTACTCCAAATGAGAAACAGCAATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGGGATCTGAACGGGGTGTGTTTCGACAGTCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCCAACGGAGCTTATCATATTGCCTTAAAAGGGCAGCTTCTTCTGAACCACCTCAAAGAAAGCAAAGATCTGAAATTACAAAACGGCATCAGCAACCAAGATTGGCTGGCCTACATTCAGGAACTGAGAAACTGA SEQATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTCGGGCCGGTCTATGG IDAAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACACGGAATGGCTCAGTCTTCTCCGTCAAGAGNO: AACTTGTATCGAAGAAGTCCGAATGGGGACGGAGAGCAAGAATGTGATAAGACTGCAGAAGAATGCAA33 AGCCGAATTGTTGGAGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCGGCGGGATCGGACGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCGATAGGTGCGAAAGGCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGGCCGACAAGGACGCAGTTGGTGGGCTTGGAATCGCGAAGGCGGGGAACAAACCGCGGTGGGTTCGCATGCGCGAAGCGGGGGAACCAGGCTGGGAAGAGGAGAAGGAGAAGGCTGAGACGAGGAAATCTGCGGATCGGACTGCGGATGTTTTGCGCGCGCTCGCGGATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTGAGATGTCATCGGTGGAGTGGAAACCGCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGACATGTTCCAACAAGCTATCGAACGGATGATGTCGTGGGAGTCGTGGAATCAGCGCGTTGGGCAAGAGTACGCGAAACTCGTAGAACAAAAAAATCGATTTGAGCAGAAGAATTTCGTCGGCCAGGAACATCTGGTCCATCTCGTCAATCAGTTGCAACAAGATATGAAAGAAGCATCGCCCGGACTCGAATCGAAAGAGCAAACCGCGCACTATGTGACGGGACGGGCATTGCGCGGATCGGACAAGGTATTTGAGAAGTGGGGGAAACTCGCCCCCGATGCACCTTTCGATTTGTACGACGCCGAAATCAAGAATGTGCAGAGACGTAACACGAGACGATTCGGATCACATGACTTGTTCGCAAAATTGGCAGAGCCAGAGTATCAGGCCCTGTGGCGCGAAGATGCTTCGTTTCTCACGCGTTACGCGGTGTACAACAGCATCCTTCGCAAACTGAATCACGCCAAAATGTTCGCGACGTTTACTTTGCCGGATGCAACGGCGCACCCGATTTGGACTCGCTTCGATAAATTGGGTGGGAATTTGCACCAGTACACCTTTTTGTTCAACGAATTTGGAGAACGCAGGCACGCGATTCGTTTTCACAAGCTATTGAAAGTCGAGAATGGTGTCGCAAGAGAAGTTGATGATGTCACCGTGCCCATTTCAATGTCAGAGCAATTGGATAATCTGCTTCCCAGAGATCCCAATGAACCGATTGCGCTATATTTTCGAGATTACGGAGCCGAACAGCATTTCACAGGTGAATTTGGTGGCGCGAAGATCCAGTGCCGCCGGGATCAGCTGGCTCATATGCACCGACGCAGAGGGGCGAGGGATGTTTATCTCAATGTCAGCGTACGTGTGCAGAGTCAGTCTGAGGCGCGGGGAGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCTGGTCGGGGACAACCATCGCGCGTTTGTCCATTTCGATAAACTATCGGATTATCTTGCGGAACATCCGGATGATGGGAAGCTCGGGTCGGAGGGGTTGCTTTCCGGGCTGCGGGTGATGAGTGTCGATCTCGGCCTTCGCACATCTGCATCGATTTCCGTTTTTCGCGTTGCCCGGAAGGACGAGTTGAAGCCGAACTCAAAAGGTCGTGTACCGTTTTTCTTTCCGATAAAAGGGAATGACAATCTCGTCGCGGTTCATGAGCGATCACAACTCTTGAAGCTGCCTGGCGAAACGGAGTCGAAGGACCTGCGTGCTATCCGAGAAGAACGCCAACGGACATTGCGGCAGTTGCGGACGCAACTGGCGTATTTGCGGCTGCTCGTGCGGTGTGGGTCGGAAGATGTGGGGCGGCGTGAACGGAGTTGGGCAAAGCTTATCGAGCAGCCGGTGGATGCGGCCAATCACATGACACCGGATTGGCGCGAGGCTTTTGAAAACGAACTTCAGAAGCTTAAGTCACTCCATGGTATCTGTAGCGACAAGGAATGGATGGATGCTGTCTACGAGAGCGTTCGCCGCGTGTGGCGTCACATGGGCAAACAGGTTCGCGATTGGCGAAAGGACGTACGAAGCGGAGAGCGGCCCAAGATTCGCGGCTATGCGAAAGACGTGGTCGGTGGAAACTCGATTGAGCAAATCGAGTATCTGGAACGTCAGTACAAGTTCCTCAAGAGTTGGAGCTTCTTTGGTAAGGTGTCGGGACAAGTGATTCGTGCGGAGAAGGGATCTCGTTTTGCGATCACGCTGCGCGAACACATTGATCACGCGAAGGAAGATCGGCTGAAGAAATTGGCGGATCGCATCATTATGGAGGCTCTCGGCTATGTGTACGCGTTGGATGAGCGCGGCAAAGGAAAGTGGGTTGCGAAGTATCCGCCGTGCCAGCTCATCCTGCTGGAGGAATTGAGCGAGTACCAGTTCAATAACGACAGGCCTCCGAGCGAAAACAACCAGTTGATGCAATGGAGTCATCGCGGCGTGTTCCAGGAGTTGATAAATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAGCGTTCTCGTCGCGATTCGACGCGCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGCGCGTTGCACCCAGGAGCACAATCCAGAACCATTTCCTTGGTGGCTGAACAAGTTTGTGGTGGAACATACGTTGGATGCTTGTCCCCTACGCGCAGACGACCTCATCCCAACGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGCGCGGAGGAGGGGGACTTTCATCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTCTGGTCTGATTTTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTGAACTCGTTCTGATCCCAAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAACAAGGTGTTTTATACCAATACAGGTGTCACCTATTATGAGCGAGAGCGGGGGAAGAAGCGGAGAAAGGTTTTCGCGCAAGAGAAATTGTCGGAGGAAGAGGCGGAGTTGCTCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGATGCGTGATCCGTCTGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGTGAACCAGCGGATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTACAAGATAGTGCGTGTGAAAACACGGGGGATATTTAA SEQATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGGGATTATGGAAGACG IDCATGAGGTATTGAATCATGGAATTGCCTACTACATGAATATTCTGAAACTAATTAGACAGGAAGCTATTTNO: ATGAACATCATGAACAAGATCCTAAAAATCCGAAAAAAGTTTCAAAAGCAGAAATACAAGCCGAGTTA34TGGGATTTTGTTTTAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAGATGTTGTTTTTAACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAGGGTGAAGCCAATCAATTATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACAGTCAAAGTGGGAAAGGGACGGCATCATCCGGACGTAAACCTCGGTGGTATAATTTAAAAATAGCAGGCGACCCATCGTGGGAGGAAGAAAAGAAAAAATGGGAAGAGGATAAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTAAGTTAGCAGAATATGGGCTTATTCCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAAAGAAATTAAATGGATGGAAAAAAGTCGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTTATCCAAGCATTAGAGCGTTTTCTTTCATGGGAAAGCTGGAACCTTAAAGTAAAGGAAGAGTATGAAAAAGTTGAAAAGGAACACAAAACACTAGAGGAAAGGATAAAAGAGGACATTCAAGCATTTAAATCCCTTGAACAATATGAAAAAGAACGGCAGGAGCAACTTCTTAGAGATACATTGAATACAAATGAATACCGATTAAGCAAAAGAGGATTACGTGGTTGGCGTGAAATTATCCAAAAATGGCTAAAGATGGATGAAAATGAACCATCAGAAAAATATTTAGAAGTATTTAAAGATTATCAACGGAAACATCCACGAGAAGCCGGGGACTATTCTGTCTATGAATTTTTAAGCAAGAAAGAAAATCATTTTATTTGGCGAAATCATCCTGAATATCCTTATTTGTATGCTACATTTTGTGAAATTGACAAAAAAAAGAAAGACGCTAAGCAACAGGCAACTTTTACTTTGGCTGACCCGATTAACCATCCGTTATGGGTACGATTTGAAGAAAGAAGCGGTTCGAACTTAAACAAATATCGAATTTTAACAGAGCAATTACACACTGAAAAGTTAAAAAAGAAATTAACAGTTCAACTTGATCGTTTAATTTATCCAACTGAATCCGGCGGTTGGGAGGAAAAAGGTAAAGTAGATATCGTTTTGTTGCCGTCAAGACAATTTTATAATCAAATCTTCCTTGATATAGAAGAAAAGGGGAAACATGCTTTTACTTATAAGGATGAAAGTATTAAATTCCCCCTTAAAGGTACACTTGGTGGTGCAAGAGTGCAGTTTGACCGTGACCATTTGCGGAGATATCCGCATAAAGTAGAATCAGGAAATGTTGGACGGATTTATTTTAACATGACAGTAAATATTGAACCAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACATAGGGACGATTTCCCCAAGTTCGTTAATTTTAAACCGAAAGAGCTCACCGAATGGATAAAAGATAGTAAAGGGAAAAAATTAAAAAGTGGTATAGAATCCCTTGAAATTGGTCTACGGGTGATGAGTATCGACTTAGGTCAACGTCAAGCGGCTGCTGCATCGATTTTTGAAGTAGTTGATCAGAAACCGGATATTGAAGGGAAGTTATTTTTTCCAATCAAAGGAACTGAGCTTTATGCTGTTCACCGGGCAAGTTTTAACATTAAATTACCGGGTGAAACATTAGTAAAATCACGGGAAGTATTGCGGAAAGCTCGGGAGGACAACTTAAAATTAATGAATCAAAAGTTAAACTTTCTAAGAAATGTTCTACATTTCCAACAGTTTGAAGATATCACAGAAAGAGAGAAGCGTGTAACTAAATGGATTTCTAGACAAGAAAATAGTGATGTTCCTCTTGTATATCAAGATGAGCTAATTCAAATTCGTGAATTAATGTATAAACCCTATAAAGATTGGGTTGCCTTTTTAAAACAACTCCATAAACGGCTAGAAGTCGAGATTGGCAAAGAGGTTAAGCATTGGCGAAAATCATTAAGTGACGGGAGAAAAGGTCTTTACGGAATCTCCCTAAAAAATATTGATGAAATTGATCGAACAAGGAAATTCCTTTTAAGATGGAGCTTACGTCCAACAGAACCTGGGGAAGTAAGACGCTTGGAACCAGGACAGCGTTTTGCGATTGATCAATTAAACCACCTAAATGCATTAAAAGAAGATCGATTAAAAAAGATGGCAAATACGATTATCATGCATGCCTTAGGTTACTGTTATGATGTAAGAAAGAAAAAGTGGCAGGCAAAAAATCCAGCATGTCAAATTATTTTATTTGAAGATTTATCTAACTACAATCCTTACGAGGAAAGGTCCCGTTTTGAAAACTCAAAACTGATGAAGTGGTCACGGAGAGAAATTCCACGACAAGTCGCCTTACAAGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCCAATTCAGTTCAAGATTCCATGCGAAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAAAATTGCAGGATAATCGCTTTTTTAAAAATTTACAAAGAGAAGGACGACTTACTCTTGATAAAATCGCAGTTTTAAAAGAAGGAGACTTATATCCAGATAAAGGTGGAGAAAAGTTTATTTCTTTATCAAAGGATCGAAAGTTGGTAACTACGCATGCTGATATTAACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACACATGGATTTTATAAAGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGCAAGGACCAAAAACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAAAAGATGGTGTATATGAATGGGGTAATGCGGGGAAACTAAAAATTAAAAAAGGTTCCTCTAAACAATCATCGAGTGAATTAGTAGATTCGGACATACTGAAAGATTCATTTGATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTTATATCGAGATCCGAGTGGAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCAAATTAGAAAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAAGATGATTCTTCAAAACAATCAATGTAA SEQATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACGCAACATTGCGACGC IDGCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAAACGTATTGAGGAATTCTTGTTGCTGTGTCNO: GTGGAGAAGCCTACAGAACAGTGGATAATGAGGGGAAGGAAGCCGAGATTCCACGTCATGCAGTCCAA35 GAAGAAGCTCTTGCCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCTATGAAGACCAAGAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAACGAAAACAACGAGGCAGGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCATGTCGGCAGAAAGCGAAGGAGGCTTGTCGGTCTACGACAAGGTGCTTGATCCACCGCCGGTTTGGATGAAGCTTAAAGAAGAAAAGGCTCCAGGATGGGAAGCCGCTTCTCAAATTTGGATTCAGAGTGATGAGGGACAGTCGTTACTTAATAAGCCAGGTAGCCCTCCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGATGATTTCGTCAGTGACCAAAAGAAAAAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATAAAACAACTCAAAGAAATGGGGTTGTTGCCTCTTGTTAACCCATTTTTTAGACATCTTCTTGACCCTGAAGGTAAAGGCGTGAGTCCATGGGACCGTCTTGCTGTACGCGCTGCAGTGGCTCACTTTATCTCCTGGGAAAGTTGGAATCATAGAACACGTGCAGAATACAATTCCTTGAAACTACGGCGAGACGAGTTTGAGGCAGCATCCGACGAATTCAAAGACGATTTTACTTTGCTCCGACAATATGAAGCCAAACGCCATAGTACATTGAAAAGCATCGCGCTGGCCGACGATTCGAACCCTTACCGGATTGGAGTACGTTCTCTGCGTGCCTGGAACCGCGTTCGTGAAGAATGGATAGACAAGGGTGCAACAGAAGAACAACGCGTGACCATATTGTCAAAGCTTCAAACACAACTTCGGGGAAAATTCGGCGATCCCGATCTGTTCAACTGGCTAGCTCAGGATAGGCATGTCCATTTGTGGTCTCCTCGGGACAGCGTGACACCATTGGTTCGCATCAATGCGGTAGATAAAGTTCTGCGTCGACGAAAACCGTATGCATTGATGACCTTTGCCCATCCCCGCTTCCACCCTCGATGGATACTGTACGAGGCTCCAGGAGGAAGCAATCTCCGTCAATATGCATTGGATTGTACAGAAAACGCTCTACACATCACGTTGCCTTTGCTTGTCGACGATGCGCACGGAACCTGGATTGAAAAAAAGATCAGGGTGCCGCTGGCACCATCCGGACAAATTCAAGATTTAACTCTGGAAAAACTTGAGAAGAAAAAAAATCGTTTATACTACCGTTCCGGTTTTCAGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGTTCTTTTCCACAGACCCTATATGGAACACGACGAACGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAGCCGTTTGGTTCAAATTGACCCTGGATGTGGCAACACAGGCTCCCCCGAACTGGCTTGATGGTAAGGGCCGTGTCCGTACACCGCCGGAGGTACATCATTTTAAAACCGCATTGTCGAATAAAAGCAAACATACACGTACGCTGCAGCCGGGTCTCCGTGTCTTGTCAGTAGACTTGGGCATGCGAACATTCGCCTCCTGCTCAGTATTTGAACTCATCGAGGGAAAGCCTGAGACAGGCCGTGCCTTCCCTGTTGCCGATGAGAGATCAATGGACAGCCCGAATAAACTGTGGGCCAAGCATGAACGTAGTTTTAAACTGACGCTCCCCGGCGAAACCCCTTCTCGAAAGGAAGAGGAAGAGCGTAGCATAGCAAGAGCGGAAATTTATGCACTGAAACGCGACATACAACGCCTCAAAAGCCTACTCCGCTTAGGTGAAGAAGATAACGATAACCGTCGTGATGCATTGCTTGAACAGTTCTTTAAAGGATGGGGAGAAGAAGACGTTGTGCCTGGACAAGCGTTTCCACGCTCTCTTTTCCAAGGGTTGGGAGCTGCCCCGTTTCGCTCAACTCCAGAGTTATGGCGTCAGCATTGCCAAACATATTATGACAAAGCGGAAGCCTGTCTGGCTAAACATATCAGTGATTGGCGCAAGCGAACTCGTCCCCGTCCGACATCGCGGGAGATGTGGTACAAAACACGTTCCTATCATGGCGGCAAGTCCATTTGGATGTTGGAATATCTTGATGCCGTTCGAAAACTGCTTCTCAGTTGGAGCTTACGTGGTCGTACTTACGGTGCCATTAATCGCCAGGATACAGCCCGGTTTGGTTCTTTGGCATCACGGCTGCTCCACCATATCAATTCCCTAAAGGAAGACCGCATCAAAACAGGAGCCGACTCTATCGTTCAGGCTGCTCGCGGGTATATTCCTCTCCCTCATGGCAAGGGTTGGGAACAAAGATATGAGCCTTGTCAGCTCATATTATTTGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAGAGAACAGCCAACTCATGCAGTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCGAACTCTACGGACAAATTGTCGAAAATACTGCAGCGGGGTTCAGCAGTCGTTTTCACGCGGCGACAGGTGCCCCCGGTGTACGTTGTCGTTTTCTTCTAGAAAGAGACTTTGATAACGATTTGCCCAAACCGTACCTTCTCAGGGAACTTTCTTGGATGCTCGGCAATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAAAAATCAGGCCAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCCAAAAGACAAACACTTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTACAACGCCGGTTTTTCGGTCGATGCGGCGAGGCCTTTCGGCTTGTTTGTCAACCCCACGGTGACGACGTGTTACGACTCGCATCCACCCCAGGAGCTCGTCTTCTTGGAGCCCTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGACATGGGGTCAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAAACCCCTTCAGGACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTCCCGGAGGAAGACGACACAGGACGTATCACAGTCTTCCGCGATTCATCAGGAATCTTTTTTCCTTGCAACGTCTGGATACCGGCCAAACAGTTTTGGCCAGCAGTACGCGCCATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTGA SEQATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCAAAAAGAGGGAAGT IDATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCCGGTTAAAAAAGGTCAGGTTACTGAGTTNO:CCGGAAAGCGTTTTCTGCGTATGCTCGCGCAACGAAAGGAGAAATGACTGACGGCCGAAAGAATATGTT 36TACGCATAGTTTCGAGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCAGATAAAGCATATCAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCTCACGCATTAGGTTTTCGTATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGGCATCCTCTAAAATTGAAGCTTACGAATCAAAATTGGCAAGCGAATTAGCTTGTGTAGATTTATCTATTCAAAACTTGACTATTTCAACGCTTTTTAATGCGCTTACAACGTCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACCCCTTAATTGCACGATTTTACACCTTACTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGAACGTGATTTAGCAGAAGTTATCTCGCGTAAGATAGCTAGTTCTTTTGGCACATGGAAAGAAATGACGGCAAACCCTCTTCAGTCATTACAATTTTTTGAAGAGGAACTCCATGCGCTGGATGCCAATGTCTCGCTCTCACCCGCCTTCGACGTTTTAATTAAAATGAATGATTTGCAGGGCGATTTAAAAAATCGAACCATTGTTTTTGATCCTGACGCCCCTGTTTTTGAATATAACGCAGAAGACCCTGCCGACATAATTATTAAACTTACAGCTCGTTACGCTAAAGAAGCTGTCATCAAAAATCAAAACGTAGGAAATTACGTTAAAAACGCTATTACTACCACAAATGCCAATGGTCTTGGTTGGCTTTTGAACAAAGGTTTGTCGTTACTCCCTGTCTCGACCGATGACGAATTGCTAGAGTTTATTGGCGTTGAACGATCTCATCCCTCATGCCATGCCTTAATTGAATTGATTGCACAATTAGAAGCCCCCGAGCTCTTTGAGAAGAACGTATTTTCAGATACTCGTTCTGAAGTTCAAGGTATGATTGATTCAGCTGTTTCTAATCATATTGCTCGTCTTTCCAGCTCTAGAAATAGCTTGTCAATGGATAGTGAAGAATTAGAACGTTTAATCAAAAGCTTTCAGATACACACACCTCATTGCTCACTTTTTATTGGCGCCCAATCACTTTCACAGCAGTTAGAATCTTTGCCTGAAGCCCTTCAATCGGGCGTTAATTCAGCCGATATTTTACTAGGCTCTACTCAATATATGCTCACCAATTCTTTGGTTGAAGAGTCAATTGCAACTTATCAAAGAACACTTAATCGCATCAATTACTTGTCAGGTGTTGCAGGTCAGATTAACGGCGCAATAAAGCGAAAAGCGATAGATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTCAGAGTTGATATCTTTACCATTTATAGGCCAGCCTGTTATAGATGTTGAAAGCGATTTAGCTCATCTAAAAAATCAATACCAAACACTTTCAAATGAGTTTGATACTCTTATATCTGCTTTGCAAAAGAATTTTGATTTGAACTTTAATAAAGCGCTCCTTAATCGTACTCAGCATTTTGAAGCCATGTGTAGAAGCACTAAGAAAAACGCTTTATCCAAACCAGAGATCGTTTCCTATCGCGACCTGCTTGCTCGATTAACTTCTTGTTTGTATCGAGGCTCTTTAGTTTTGCGTCGTGCCGGCATTGAAGTGTTAAAAAAACATAAAATATTTGAGTCAAACAGCGAACTTCGTGAACATGTTCATGAAAGAAAGCATTTCGTGTTTGTTAGTCCTCTAGATCGCAAAGCCAAGAAACTCCTTCGATTAACTGATTCGCGTCCAGACTTGTTACATGTTATTGATGAAATATTGCAGCACGATAATCTTGAAAACAAAGACCGCGAGTCACTTTGGCTAGTTCGCTCTGGTTATTTGCTTGCAGGACTTCCAGATCAACTTTCTTCATCTTTTATTAACTTGCCTATCATTACTCAAAAAGGAGATAGACGCCTTATAGACCTGATTCAGTATGATCAAATTAATCGTGATGCTTTTGTTATGTTAGTGACCTCTGCATTCAAGTCTAATTTGTCTGGTCTGCAGTATCGTGCCAATAAGCAATCGTTCGTTGTTACTCGCACGCTAAGCCCTTATCTCGGCTCAAAACTTGTCTACGTACCCAAGGATAAAGATTGGTTAGTTCCTTCTCAAATGTTTGAAGGACGATTTGCTGACATTCTTCAATCAGATTATATGGTCTGGAAAGATGCCGGTCGTCTTTGTGTTATTGATACTGCAAAACACCTTTCTAATATAAAGAAGTCTGTATTTTCATCCGAAGAAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACATTTATCCAGACCGAAGTTCGCGGCCTTGGCGTTAATGTCGATGGAATTGCATTTAATAATGGTGATATTCCGTCATTAAAAACCTTTTCAAATTGCGTTCAGGTAAAAGTTTCTCGGACTAATACATCCCTAGTTCAAACACTTAATCGTTGGTTTGAAGGAGGAAAAGTTTCTCCTCCGAGCATTCAATTTGAACGGGCGTATTATAAAAAAGACGATCAAATTCATGAAGACGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTTGGTTCATGCTTCTGACGATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGTATGGAATGGGTCTTTCATTGGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGGCTTTATTCATATTAATTCTCTGATCAATTTTGCCTCTAAAAAGAGCAACCATCAAACTAAGGTTGTTCCGCGTCAGCAGTACAAATCTCCTTATGCAAATTATTTAGAACAATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCGACTTATATACAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCTGCTGATCAAGTTTGGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACGCTCAGAATTCTATTAGAAAGCAGCATTGGTTTGGAGCCAGTCATTGGGATATCAAAGGTATGTTAAGGCAACCCCCTACGGAGAAGAAGCCTAAACCGTATATTGCTTTTCCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTCTTGCTGCGGTCGCAATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCTAAAAATTCGCAATTCTGAGATACAGCTTTTTGACGGAACCATTAAATTATTTAATCCAGACCCATCCACTGTGATAGAGAGAAGGCGACATAATCTTGGTCCATCAAGAATTCCTGTTGCTGACCGTACTTTCAAAAACATCAGTCCATCAAGTCTAGAATTTAAAGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTCACCTGAGTTTATCGCTAAAAAACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAACTCATCCTTAAATTCTGAAGCTAACGCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACAGTTATTTTTTGAGTTATAA SEQATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAGGCAAAGGTTCTGAATID TAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCAAGGCGCGGTTGAAGAACTTGCTGAAGCAANO:TTCGGCACGACAACCTGCACCTTTTTGGGCAGAAGGAAATAGTGGATCTTATGGAGAAAGACGAAGGAA 37CCCAGGTGTATTCGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTTCTCACCATCAGCGAATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCTTTTCGTAAGGTTTTGGAGCAGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTGAACCTGCGGAAAGGATACTTTCTGTTGAAATCAGAAAGATTGGTAAAAGAGAAAACAGAGTTGAGAACTATGCCGCCGATGTGGAGACATGCTTCATTGGTCAGCTTTCTTCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATGATATCTGGGATAGCAAGGATCATGAGGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTCTTATAAAAGACCCCAAAGCTGTCACAGAAGAAGATCCTGAAAATGAAACGGCGGGAAAACAGAAACCGCTTGAATTATGTGTTTGTCTTGTTCCTGAGTTGTATACCCGAGGTTTCGGCTCCATTGCTGATTTTCTGGTTCAGCGACTTACCTTGCTGCGTGACAAAATGAGTACCGACACGGCGGAAGATTGCCTCGAGTATGTTGGCATTGAGGAAGAAAAAGGCAATGGAATGAATTCCTTGCTCGGCACTTTTTTGAAGAACCTGCAGGGTGATGGTTTTGAACAGATTTTTCAGTTTATGCTTGGGTCTTATGTTGGCTGGCAGGGGAAGGAAGATGTACTGCGCGAACGATTGGATTTGCTGGCCGAAAAAGTCAAAAGATTACCAAAGCCAAAATTTGCCGGAGAATGGAGTGGTCATCGTATGTTTCTCCATGGTCAGCTGAAAAGCTGGTCGTCGAATTTCTTCCGTCTTTTTAATGAGACGCGGGAACTTCTGGAAAGTATCAAGAGTGATATTCAACATGCCACCATGCTCATTAGCTATGTGGAAGAGAAAGGAGGCTATCATCCACAGCTGTTGAGTCAGTATCGGAAGTTAATGGAACAATTACCGGCGTTGCGGACTAAGGTTTTGGATCCTGAGATTGAGATGACGCATATGTCCGAGGCTGTTCGAAGTTACATTATGATACACAAGTCTGTAGCGGGATTTCTGCCGGATTTACTCGAGTCTTTGGATCGAGATAAGGATAGGGAATTTTTGCTTTCCATCTTTCCTCGTATTCCAAAGATAGATAAGAAGACGAAAGAGATCGTTGCATGGGAGCTACCGGGCGAGCCAGAGGAAGGCTATTTGTTCACAGCAAACAACCTTTTCCGGAATTTTCTTGAGAATCCGAAACATGTGCCACGATTTATGGCAGAGAGGATTCCCGAGGATTGGACGCGTTTGCGCTCGGCCCCTGTGTGGTTTGATGGGATGGTGAAGCAATGGCAGAAGGTGGTGAATCAGTTGGTTGAATCTCCAGGCGCCCTTTATCAGTTCAATGAAAGTTTTTTGCGTCAAAGACTGCAAGCAATGCTTACGGTCTATAAGCGGGATCTCCAGACTGAGAAGTTTCTGAAGCTGCTGGCTGATGTCTGTCGTCCACTCGTTGATTTTTTCGGACTTGGAGGAAATGATATTATCTTCAAGTCATGTCAGGATCCAAGAAAGCAATGGCAGACTGTTATTCCACTCAGTGTCCCAGCGGATGTTTATACAGCATGTGAAGGCTTGGCTATTCGTCTCCGCGAAACTCTTGGATTCGAATGGAAAAATCTGAAAGGACACGAGCGGGAAGATTTTTTACGGCTGCATCAGTTGCTGGGAAATCTGCTGTTCTGGATCAGGGATGCGAAACTTGTCGTGAAGCTGGAAGACTGGATGAACAATCCTTGTGTTCAGGAGTATGTGGAAGCACGAAAAGCCATTGATCTTCCCTTGGAGATTTTCGGATTTGAGGTGCCGATTTTTCTCAATGGCTATCTCTTTTCGGAACTGCGCCAGCTGGAATTGTTGCTGAGGCGTAAGTCGGTGATGACGTCTTACAGCGTCAAAACGACAGGCTCGCCAAATAGGCTCTTCCAGTTGGTTTACCTACCTCTAAACCCTTCAGATCCGGAAAAGAAAAATTCCAACAACTTTCAGGAGCGCCTCGATACACCTACCGGTTTGTCGCGTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCAAACTCTTGACGGATCCGGTAACTCAGGAACTGAAGACGATGGCCGGTTTTTACGATCATCTCTTTGGCTTCAAGTTGCCGTGTAAACTGGCGGCGATGAGTAACCATCCAGGATCCTCTTCCAAAATGGTGGTTCTGGCAAAACCAAAGAAGGGTGTTGCTAGTAACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTCCGGGTGAGAAGTTCCTGGCCGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACACCACTGACCATTGAACTGGCGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAGTCGCTTTCGATTTGAAGAATCTGACGACTATCTTGGGTCGTGTTGGTGAATTCAGGGTGACGGCAGATCAACCTTTCAAGCTGACGCCCATTATTCCTGAGAAAGAGGAATCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATCTGGCGTTGGTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGTGCATGAAGATACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAGGAGCCGGTTTTCCAGCCACTCCGTAAGGGCACATTTCGTCAGCAGGAGCGCATTCGCAAAAGCCTCCGCGGTTGCTACTGGAATTTCTATCATGCATTGATGATCAAGTACCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTCTGGTGGGGCAGTGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGGGTGGAAAAAATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTTATGGGGAGGAGCTTTCTCGAAGAAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAGGCAGCTGGATCAAGCCAGTTTTGTCTGAAGTGTGGTTGGTGGTTTCAGTTGGGGATGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCGTGGTGCTGGACTGGAACCGGTCCATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATGGTTTCAGTCCTCAGCAACTGGAAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAAGGGATTTTATGAGACCCCCCATGTTTGATCGCAAAGGTCGGCCGGCCGCGGCGTATGAAAGATTCGTACTGGGACGTCGTCACCGTCGTTATCGCTTTGATAAAGTTTTTGAAGAGAGATTTGGTCGCAGTGCTCTTTTCATCTGCCCGCGGGTCGGGTGTGGGAATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTCCTTGCCCTTATTGGTTACATTGCTGATAAGGAAGGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGCTGGCTGAACTTATGGCTGAGTGGAAGCTGAAGAAACTGGAGAGATCAAGGGTGGAAGAACAGAGCTCGGCACAATAA SEQATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATGAAGTAATTGGATT IDGGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAATCACACCAGCGCGCGCAAGATTCAGA NO:ACAAGAAAAAGCGCGACAAAAAGTATGGATCCGCAAGCAAAGCGCAGAGCCAGAGGATAGCTGTGGCT 38GGCGCGCTTTATCCAGACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCGGATCTTAATGGCGAAGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGATGAGATCGGCCTGCTTGGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAACAGAGCGAGCCGTATTCAGTTGTAGATTTTTGGTTTGACGCGGTGTGCGCAGGCGGAGTATTCGCGTATTCTGGCGCGCGCCTGCTTTCCACAGTCCTCCAGTTGAGTGGCGAGGAAAGCGTTTTGCGCGCTGCTTTAGCATCTAGCCCGTTTGTAGATGACATTAATTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCACAGGCCAAGATAAGCTAGGCAAGCGCATTGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGGCATCAAAGATCGCATGCGCGAATTCGTGCAAGCGATTGATGTGGCCCAAACCGCGGGCCAGCGGTTCGCGGCCAAGCTAAAGATATTCGGCATCAGTCAGATGCCTGAAGCCAAGCAATGGAACAATGATTCCGGGCTCACTGTATGTATTTTGCCGGATTATTATGTCCCGGAAGAAAACCGCGCGGACCAGCTGGTTGTTTTGCTTCGGCGCTTACGCGAGATCGCGTATTGCATGGGAATTGAGGATGAAGCAGGATTTGAGCATCTAGGCATTGACCCTGGTGCTCTTTCCAATTTTTCCAATGGCAATCCAAAGCGAGGATTTCTCGGCCGCCTGCTCAATAATGACATTATAGCGCTGGCAAACAACATGTCAGCCATGACGCCGTATTGGGAAGGCAGAAAAGGCGAGTTGATTGAGCGCCTTGCATGGCTTAAACATCGCGCTGAAGGATTGTATTTGAAAGAGCCACATTTCGGCAACTCCTGGGCAGACCACCGCAGCAGGATTTTCAGTCGCATTGCGGGCTGGCTTTCCGGATGCGCGGGCAAGCTCAAGATTGCCAAGGATCAGATTTCAGGCGTGCGTACGGATTTGTTTCTGCTCAAGCGCCTTCTGGATGCGGTACCGCAAAGCGCGCCGTCGCCGGACTTTATTGCTTCCATCAGCGCGCTGGATCGGTTTTTGGAAGCGGCAGAAAGCAGCCAGGATCCGGCAGAACAGGTACGCGCTTTGTACGCGTTTCATCTGAACGCGCCTGCGGTCCGATCCATCGCCAACAAGGCGGTACAGAGGTCTGATTCCCAGGAGTGGCTTATCAAGGAACTGGATGCTGTAGATCACCTTGAATTCAACAAAGCATTTCCGTTTTTTTCGGATACAGGAAAGAAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCTTCTGAAGAAGAATACACGGAAACAGAATCCATTCAACAACCAGAAGATGCAGAGCAGGAAGTGAATGGTCAAGAAGGAAATGGCGCTTCAAAGAACCAGAAAAAGTTTCAGCGCATTCCTCGATTTTTCGGGGAAGGGTCAAGGAGTGAGTATCGAATTTTAACAGAAGCGCCGCAATATTTTGACATGTTCTGCAATAATATGCGCGCGATCTTTATGCAGCTAGAGAGTCAGCCGCGCAAGGCGCCTCGTGATTTCAAATGCTTTCTGCAGAATCGTTTGCAGAAGCTTTACAAGCAAACCTTTCTCAATGCTCGCAGTAATAAATGCCGCGCGCTTCTGGAATCCGTCCTTATTTCATGGGGAGAATTTTATACTTATGGCGCGAATGAAAAGAAGTTTCGTCTGCGCCATGAAGCGAGCGAGCGCAGCTCGGATCCGGACTATGTGGTTCAGCAGGCATTGGAAATCGCGCGCCGGCTTTTCTTGTTCGGATTTGAGTGGCGCGATTGCTCTGCTGGAGAGCGCGTGGATTTGGTTGAAATCCACAAAAAAGCAATCTCATTTTTGCTTGCAATCACTCAGGCCGAGGTTTCAGTTGGTTCCTATAACTGGCTTGGGAATAGCACCGTGAGCCGGTATCTTTCGGTTGCTGGCACAGACACATTGTACGGCACTCAACTGGAGGAGTTTTTGAACGCCACAGTGCTTTCACAGATGCGTGGGCTGGCGATTCGGCTTTCATCTCAGGAGTTAAAAGACGGATTTGATGTTCAGTTGGAGAGTTCGTGCCAGGACAATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCGCGACTTGGCTGCGTGCAAACGCGCTACATGCCCGGCTGAATTGGATCCGAAAATTCTTGTTCTGCCGGTTGGTGCGTTTATCGCGAGCGTAATGAAAATGATTGAGCGTGGCGATGAACCATTAGCAGGCGCGTATTTGCGTCATCGGCCGCATTCATTCGGCTGGCAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGGCACAGCGCTAGCATTCCAGAAGCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTCAATACGGCCCAGTACTTTGGCTTAATTCTTCATCCTATAGCCAGAGCCAGTATCTGGATGGATTTTTAAGCCAGCCAAAGAATTGGTCTATGCGGGTGCTACCTCAAGCCGGATCAGTGCGCGTGGAACAGCGCGTTGCTCTGATATGGAATTTGCAGGCAGGCAAGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGTGCCATTCAGCTTCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTGGGACTTTTTCCGCATTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATTCCGCGGGTTTCAAAATTCTTGAGCGCGGTACGATTGCGGTAAATGGCTTTTCCCAGAAGCGCGGCGAACGCCAAGAGGAGGCACACAGAGAAAAACAGAGACGCGGAATTTCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTGACGCAGCCAATGAATTGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTTCAGTGGGCGCCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCGCGCGCAGTGCGGACCGAAGCGCCGCGATCTGGAAATCAAGAGGATCATGCTCGTATGAAATCCTCTTGGGGATATACCTGGGGCACCTATTGGGAGAAGCGCAAACCAGAGGATATTTTGGGCATCTCAACCCAAGTATACTGGACCGGCGGTATAGGCGAGTCATGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACATCCACTCAAACTGAATGGGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAGCTAG SEQATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAAAGCCTGTGAGCAG IDGAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGACGACTTGAAAAAAAGACTGGAGAAGC NO:GTCGGAAAAAGCCGGAAGTTATGCCGCAGGTTATTTCAAATAACGCAGCAAACAATCTTAGAATGCTCC 39TTGATGACTATACAAAGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAAGGACGACCATGTGGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAAACTAAAACCGGAAATGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTTCTCAATGCGGTCAGCCGCTATTTGTTTATAAGCTTGAACAGGTGAGTGAAAAAGGCAAGGCTTATACAAATTACTTCGGCCGGTGTAATGTGGCCGAGCATGAGAAATTGATTCTTCTTGCTCAATTAAAACCTGAAAAAGACAGTGACGAAGCAGTGACATACTCCCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCCACGTAACAAAAGAATCCACCCATCCAGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCAAGCGGACCTGTTGGCAAGGCCCTTTCCGATGCCTGTATGGGCACTATAGCCAGTTTTCTTTCGAAATATCAAGACATCATCATAGAACATCAAAAGGTTGTGAAGGGTAATCAAAAGAGGTTAGAGAGTCTCAGGGAATTGGCAGGGAAAGAAAATCTTGAGTACCCATCGGTTACACTGCCGCCGCAGCCGCATACGAAAGAAGGGGTTGACGCTTATAACGAAGTTATTGCAAGGGTACGTATGTGGGTTAATCTTAATCTGTGGCAAAAGCTGAAGCTCAGCCGTGATGACGCAAAACCGCTACTGCGGCTAAAAGGATTCCCATCTTTCCCTGTTGTGGAGCGGCGTGAAAACGAAGTTGACTGGTGGAATACGATTAATGAAGTAAAAAAACTGATTGACGCTAAACGAGATATGGGACGGGTATTCTGGAGCGGCGTTACCGCAGAAAAGAGAAATACCATCCTTGAAGGATACAACTATCTGCCAAATGAGAATGACCATAAAAAGAGAGAGGGCAGTTTGGAAAACCCTAAGAAGCCTGCCAAACGCCAGTTTGGAGACCTCTTGCTGTATCTTGAAAAGAAATATGCCGGAGACTGGGGAAAGGTCTTCGATGAGGCATGGGAGAGGATAGATAAGAAAATAGCCGGACTCACAAGCCATATAGAGCGCGAAGAAGCAAGAAACGCGGAAGACGCTCAATCCAAAGCCGTACTTACAGACTGGCTAAGGGCAAAGGCATCATTTGTTCTTGAAAGACTGAAGGAAATGGATGAAAAGGAATTCTATGCGTGTGAAATCCAACTTCAAAAATGGTATGGCGATCTTCGAGGCAACCCGTTTGCCGTTGAAGCTGAGAATAGAGTTGTTGATATAAGCGGGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATACAGAAATCTCCTTGCCTGGAAATATCTGGAGAACGGCAAGCGTGAATTCTATCTGTTAATGAATTATGGCAAGAAAGGGCGCATCAGATTTACAGATGGAACAGATATTAAAAAGAGCGGCAAATGGCAGGGACTATTATATGGCGGTGGCAAGGCAAAGGTTATTGATCTGACTTTCGACCCCGATGATGAACAGTTGATAATCCTGCCGCTGGCCTTTGGCACAAGGCAAGGCCGCGAGTTTATCTGGAACGATTTGCTGAGTCTTGAAACAGGCCTGATAAAGCTCGCAAACGGAAGAGTTATCGAAAAAACAATCTATAACAAAAAAATAGGGCGGGATGAACCGGCTCTATTCGTTGCCTTAACATTTGAGCGCCGGGAAGTTGTTGATCCATCAAATATAAAGCCTGTAAACCTTATAGGCGTTGACCGCGGCGAAAACATCCCGGCGGTTATTGCATTGACAGACCCTGAAGGTTGTCCTTTACCGGAATTCAAGGATTCATCAGGGGGCCCAACAGACATCCTGCGAATAGGAGAAGGATATAAGGAAAAGCAGAGGGCTATTCAGGCAGCAAAGGAGGTAGAGCAAAGGCGGGCTGGCGGTTATTCACGGAAGTTTGCATCCAAGTCGAGGAACCTGGCGGACGACATGGTGAGAAATTCAGCGCGAGACCTTTTTTACCATGCCGTTACCCACGATGCCGTCCTTGTCTTTGAAAACCTGAGCAGGGGTTTTGGAAGGCAGGGCAAAAGGACCTTCATGACGGAAAGACAATATACAAAGATGGAAGACTGGCTGACAGCGAAGCTCGCATACGAAGGTCTTACGTCAAAAACCTACCTTTCAAAGACGCTGGCGCAATATACGTCAAAAACATGCTCCAACTGCGGGTTTACTATAACGACTGCCGATTATGACGGGATGTTGGTAAGGCTTAAAAAGACTTCTGATGGATGGGCAACTACCCTCAACAACAAAGAATTAAAAGCCGAAGGCCAGATAACGTATTATAACCGGTATAAAAGGCAAACCGTGGAAAAAGAACTCTCCGCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATATTTCTAAGTGGACCAAGGGTCGCCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCGGCCTGTTCAGGAACAGTTTGTTTGCCTCGATTGCGGCCATGAAGTCCACGCCGATGAACAGGCAGCCTTGAATATTGCAAGGTCATGGCTTTTTCTAAACTCAAATTCAACAGAATTCAAAAGTTATAAATCGGGTAAACAGCCCTTCGTTGGTGCTTGGCAGGCCTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC SEQATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAAAAGCCGGCAAAA IDCCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGACCTGAGAGAAAGGTTAGAGAATCTTCNO:GCAAAAAGCCGGAAAACATTCCTCAGCCCATTTCAAATACTTCACGTGCAAATTTAAATAAACTCCTCA 40CTGACTATACGGAAATGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAAAGACCCTGTCGGATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATTGATTCCGGTGAAGGACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTCAGTGCTGTCAACCCCTCTATGTTTATAAGCTTGAACAAGTGAATGACAAGGGTAAGCCCCATACAAATTACTTTGGCCGTTGTAATGTCTCCGAGCATGAACGTTTGATATTGCTCTCGCCGCATAAACCGGAGGCAAATGACGAGCTAGTAACGTATTCGTTGGGGAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAACAAGAGAATCGAACCATCCTGTAAAGCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGTCCCGTTGGTAAGGCTTTATCTGATGCCTGTATGGGAGCAGTAGCCAGTTTCCTTACAAAGTACCAGGACATCATCCTCGAACACCAAAAGGTTATAAAAAAAAACGAAAAGAGATTGGCAAATCTAAAGGATATAGCAAGTGCAAACGGGCTTGCATTTCCTAAAATCACTCTTCCACCGCAACCGCATACAAAAGAAGGGATTGAAGCTTATAACAATGTTGTTGCTCAGATAGTGATCTGGGTAAACCTGAATCTTTGGCAGAAACTCAAAATTGGCAGGGATGAGGCAAAGCCCTTACAGCGGCTTAAGGGTTTTCCGTCCTTCCCTCTTGTTGAACGCCAGGCGAATGAGGTTGATTGGTGGGATATGGTCTGTAATGTCAAAAAGTTGATTAACGAAAAGAAAGAGGACGGGAAGGTCTTCTGGCAAAATCTTGCTGGATATAAAAGGCAGGAAGCCTTGCTTCCATATCTTTCGTCTGAAGAAGACCGTAAAAAAGGAAAAAAGTTTGCGCGTTATCAGTTTGGTGACCTTTTGCTTCACCTTGAAAAGAAACACGGTGAAGATTGGGGCAAAGTTTATGATGAGGCATGGGAAAGAATAGATAAAAAAGTTGAAGGTCTGAGTAAGCACATAAAGTTGGAGGAAGAAAGAAGGTCTGAAGATGCTCAATCAAAGGCTGCCCTCACTGATTGGCTCAGGGCAAAGGCCTCTTTTGTTATTGAAGGGCTCAAAGAAGCTGATAAGGATGAGTTTTGCAGGTGTGAGTTAAAGCTTCAAAAGTGGTATGGAGATTTGAGAGGAAAACCATTTGCTATAGAAGCAGAGAACAGCATTTTAGATATAAGCGGATTTTCTAAACAGTATAATTGTGCATTTATATGGCAGAAAGACGGCGTAAAGAAGTTAAATCTTTATTTAATAATAAATTACTTCAAAGGTGGTAAGCTACGCTTCAAAAAAATCAAGCCAGAAGCTTTTGAAGCAAATAGGTTTTATACAGTAATTAATAAAAAAAGCGGTGAGATTGTGCCTATGGAGGTCAACTTCAATTTTGATGACCCGAATTTGATAATTCTGCCTTTGGCCTTTGGAAAAAGGCAGGGGAGGGAGTTTATCTGGAACGACCTATTGAGCCTTGAGACGGGTTCATTGAAACTCGCCAATGGCAGGGTTATTGAAAAAACGCTCTATAACAGAAGGACGAGACAGGATGAACCAGCACTTTTTGTTGCCCTGACATTTGAAAGAAGAGAGGTGCTTGACTCATCGAATATAAAACCGATGAATCTGATAGGAATAGACCGGGGAGAAAATATCCCGGCAGTCATAGCATTAACAGACCCGGAAGGATGCCCCTTGTCAAGATTCAAAGATTCATTGGGCAATCCAACGCATATTTTGCGAATAGGAGAAAGTTATAAGGAAAAACAACGGACTATTCAGGCTGCTAAAGAAGTTGAACAAAGGCGGGCAGGCGGATATTCGAGAAAATATGCATCAAAGGCGAAGAATCTGGCGGACGATATGGTAAGAAATACAGCTCGTGACCTCTTATATTATGCTGTTACTCAAGATGCAATGCTCATTTTTGAAAATCTTTCCCGCGGTTTTGGTAGACAAGGCAAGAGGACTTTTATGGCGGAAAGGCAGTACACGAGGATGGAAGACTGGCTGACTGCAAAGCTTGCCTATGAAGGTCTGCCATCAAAAACCTATCTTTCAAAGACTCTGGCACAGTATACCTCAAAGACATGTTCTAATTGTGGTTTTACAATCACAAGTGCAGATTATGACAGGGTGCTCGAAAAGCTCAAGAAGACGGCTACTGGATGGATGACTACAATCAATGGAAAAGAGTTAAAAGTTGAAGGACAGATAACATACTATAACCGGTATAAAAGGCAGAATGTGGTAAAAGACCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTCGGTAAATAATGATATTTCTAGTTGGACAAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAGATTTAGTCACAGGCCGGTGCAGGAAAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCATGCAGACGAACAAGCAGCACTGAATATTGCAAGGTCGTGGCTCTTTCTCCGTTCTCAAGAATATAAGAAGTATCAAACCAATAAAACGACCGGAAATACTGACAAAAGGGCATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAAGCTCAAAGAAGTATGGAAACCA SEQATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCTID CATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATCAGANO:GCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGG 41TACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAA SEQATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCA IDGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGAGAA NO:TTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACG42 CAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAA SEQATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAACCG IDCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCGGAAAAANO: TACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCTGACCAACAT43 GTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCAAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTGGTGACTAA SEQATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAAID AACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGAGCGNO:AAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCA 44TTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACGGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGGTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATTCCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGA SEQATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAACA IDTTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTAGCGGAANO:GACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCCTGA45 ATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAG SEQATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTTTTGAACTGAAACCTGID TTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCTGAAAGAAGACGAGCACCGTGCCGAAAGCNO:TACAGGCGTGTCAAAAAGATTATCGATACTTATCACAAAGTGTTCATTGATAGCAGTCTGGAGAACATG 46GCAAAAATGGGCATAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCTCTACAAGAAAGATCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCGCGGTCTGATTGTTGGCGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGCAGAACGAAAAGTACGAGTCGCTGTTCAAAGAAAAACTGATAAAAGAAATTTTGCCTGACTTTGTGCTTTCGACCGAAGCGGAATCCCTGCCATTTTCTGTCGAAGAAGCGACCCGCAGCCTGAAAGAATTTGACTCATTCACAAGTTACTTTGCAGGCTTCTACGAAAACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCTATTGCTTATCGCCTGATTCATGAGAACCTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAAATTAAAGAGCCGATTGCGAAAGAGTTAGAACATATTCGAGCTGACTTTTCTGCGGGTGGGTACATTAAAAAAGATGAGCGGCTGGAAGACATCTTCAGTCTAAACTATTATATCCACGTTCTGTCGCAGGCAGGCATTGAGAAATATAATGCGCTGATTGGTAAGATTGTCACAGAAGGCGATGGTGAGATGAAAGGTCTTAATGAACATATCAATCTGTATAACCAGCAGCGTGGTCGCGAAGACCGTCTTCCACTGTTCCGCCCACTGTATAAACAGATCCTGTCTGACCGGGAACAGCTGTCCTACCTGCCGGAAAGCTTTGAAAAGGATGAAGAGCTACTTCGCGCATTAAAGGAGTTTTACGACCATATTGCGGAAGACATTTTGGGTAGAACGCAGCAACTGATGACGTCAATTTCTGAATACGATCTGAGTAGAATCTACGTTAGGAATGATAGCCAGCTGACCGATATTAGCAAAAAAATGCTGGGCGACTGGAACGCTATCTATATGGCACGTGAACGTGCATATGATCATGAACAAGCACCGAAACGTATAACCGCGAAATATGAGCGTGATCGCATTAAGGCGCTAAAGGGAGAAGAAAGCATCTCACTCGCAAACCTGAACTCCTGTATCGCTTTCTTAGATAACGTGCGCGATTGTCGCGTCGACACGTATCTGTCAACCCTTGGGCAGAAAGAGGGTCCACATGGTCTGTCTAACCTGGTGGAAAATGTCTTTGCGAGTTACCATGAAGCGGAACAACTGCTGTCTTTTCCATACCCCGAAGAAAACAATCTAATACAGGATAAAGATAACGTGGTGTTAATCAAAAACCTGCTGGACAACATCAGCGATCTGCAACGTTTCCTGAAACCTTTGTGGGGTATGGGTGACGAGCCAGACAAAGACGAACGTTTTTATGGTGAGTATAATTATATACGTGGCGCCCTTGACCAAGTTATTCCGCTGTATAACAAAGTACGGAACTATCTGACCCGTAAGCCATATTCTACCCGTAAAGTGAAACTGAACTTTGGCAACTCGCAACTGCTGTCGGGTTGGGATCGTAACAAAGAAAAAGATAATAGTTGTGTTATCCTGCGTAAGGGACAAAATTTTTACCTCGCGATTATGAACAACAGACACAAGCGTTCATTTGAAAATAAGGTTCTGCCGGAGTATAAAGAGGGCGAACCGTACTTCGAGAAAATGGATTATAAGTTCTTACCAGACCCTAATAAGATGTTACCGAAAGTCTTTCTTTCGAAAAAAGGCATAGAAATCTATAAGCCGTCCCCGAAATTACTCGAACAGTATGGGCACGGGACCCACAAGAAAGGGGATACTTTTAGCATGGACGATCTGCACGAACTGATCGATTTTTTTAAACACTCCATCGAAGCCCATGAAGACTGGAAACAGTTTGGGTTCAAGTTCTCTGATACAGCCACATACGAGAATGTGTCTAGTTTTTATCGGGAAGTGGAGGATCAGGGCTACAAACTTAGTTTTCGTAAAGTTTCAGAGAGTTATGTTTATAGTTTAATTGATCAGGGAAAACTTTACCTGTTCCAGATCTACAACAAAGATTTCTCGCCATGTAGTAAGGGTACCCCGAATCTGCATACACTCTATTGGAGAATGTTATTCGATGAGCGTAACTTAGCGGATGTCATTTATAAATTGGACGGGAAAGCAGAGATCTTTTTTCGTGAAAAATCACTGAAGAATGACCACCCGACTCATCCGGCCGGGAAACCGATCAAAAAAAAATCCCGCCAGAAAAAAGGAGAAGAGTCTCTGTTTGAATATGATCTGGTGAAAGACCGTCATTACACTATGGATAAATTTCAATTTCATGTTCCAATTACAATGAACTTCAAATGTTCGGCGGGTTCCAAAGTAAATGATATGGTAAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCATCGATAGAGGCGAAAGAAACCTGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACCAGATCTCTTTAAACACCATCAATGACATCGATTATCACGACCTGTTGGAGTCTCGGGACAAGGACCGCCAGCAGGAGCGCCGTAATTGGCAGACAATTGAAGGCATAAAAGAATTAAAACAGGGTTACCTTTCCCAGGCCGTACACCGCATAGCGGAACTGATGGTGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGTTTAAACGTGGCCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGACAAGTTGAATTATTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTACTGCGTGCATACCAGTTTACGGCACCTTTTAAGTCCTTCAAAGAAATGGGTAAACAGAACGGGTTTCTGTTTTACATCCCGGCCTGGAATACATCCAACATCGATCCTACCACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGTGGATAAAGCGAAGAGTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTGAGTTTGCGTTCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATGTGGATCCTCTGCACCCATGGCAGTCGTATTAAAAATTTTCGTAATTCGCAAAAGAATGGCCAGTGGGACTCGGAAGAGTTTGCCCTGACCGAAGCGTTCAAATCGCTGTTTGTACGCTACGAAATTGACTACACAGCAGATCTGAAAACAGCCATCGTCGATGAAAAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTGTTCAGATGCGCAATTCCTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGGTGCTGATGGACGATTTTTCGATACTCGTGAAGGTAACAAAAGTCTCCCGAAAGATGCTGATGCCAATGGTGCATACAATATTGCATTAAAGGGGCTATGGGCCTTGCGACAGATCCGCCAGACCAGCGAAGGCGGCAAGCTGAAATTGGCCATATCGAATAAGGAATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAGATTGA SEQATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAAT IDGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGAGTTANO:CGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACT 47CTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATTGACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAA SEQATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGGGAID AAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTTACCAAGNO:AAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATTAAC48 TCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGATAACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGA SEQATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGT IDGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGNO:CTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTT 49TACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAA SEQATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTTGGID TAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAAAATACGNO:GTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTGAAC 50TGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATTCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATCGACTGGGACAACTTCGTT SEQATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGCGTCCATATGGTAAA IDACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGATGCATTTAAAGCGAACTCTCGCAGATCCNO:ATGCAGGCCATCATTGATGAAAAATTTAAAGAGACGATCGAAGAACGTCTGAAATACACGGAATTTAGT 51GAGTGTGACTTAGGTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGACCAACCTGAAGAAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCGGACAAAAACATCGATGCCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCACATTTAAGGGCTTCACAACGTATTTTGTCAATTTTTTTGAAATTCGTAAACATATCTTCAAAGGAGAATCAAGCGGCTCTATGGCTTATCGCATTATTGATGAAAACCTGACGACCTATTTGAATAACATTGAAAAAATCAAAAAACTGCCAGAGGAATTAAAGTCTCAGTTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACGAATTTATTACGCAGTCTGGTATCACCCACTATAATGAAATTATTGGAGGTATCAGTAAATCAGAAAATGTGAAAATCCAAGGGATTAATGAAGGCATTAACCTCTATTGCCAGAAAAATAAAGTGAAACTGCCGAGGCTGACTCCACTCTACAAAATGATCCTGTCTGACCGCGTCTCGAATAGCTTTGTCCTGGACACAATTGAAAACGATACGGAATTGATTGAGATGATAAGCGATCTGATTAACAAAACCGAAATTTCACAGGATGTAATCATGAGTGATATACAAAACATCTTTATTAAATATAAACAGCTTGGTAATCTGCCTGGAATTAGCTATTCGTCAATAGTGAACGCAATCTGTTCTGATTATGATAACAATTTTGGCGACGGTAAGCGTAAAAAGAGTTATGAAAACGATAGGAAAAAACACCTGGAAACTAACGTGTATTCTATCAACTATATCAGCGAACTGCTTACGGACACCGATGTGAGTTCAAACATTAAGATGCGGTATAAGGAGCTTGAACAGAACTACCAGGTCTGTAAGGAAAACTTCAACGCAACCAACTGGATGAACATTAAAAATATCAAACAATCCGAGAAGACCAACTTAATCAAAGATCTGCTGGATATTTTGAAGAGCATTCAACGTTTTTATGATCTGTTCGATATCGTTGATGAAGACAAGAATCCTAGTGCGGAATTTTATACATGGCTGTCTAAAAATGCGGAGAAATTGGATTTCGAATTCAATTCTGTTTATAATAAATCACGCAACTATTTGACCCGCAAACAATACAGCGACAAAAAGATAAAACTAAACTTCGACAGTCCGACATTGGCAAAGGGCTGGGACGCAAATAAGGAAATCGATAACTCTACGATAATTATGCGTAAGTTCAATAATGATCGAGGTGATTATGATTATTTCTTAGGCATTTGGAACAAAAGCACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAACGGTCTGTTCGAAAAAATGCAGTACAAATTATATCCGGATCCAAGCAAGATGCTTCCAAAGCAGTTTCTGTCTAAAATTTGGAAAGCTAAGCATCCGACCACCCCAGAATTTGACAAGAAATATAAGGAAGGCCGCCATAAGAAAGGTCCCGATTTTGAAAAAGAATTCTTGCACGAACTGATTGATTGCTTTAAACATGGCTTAGTCAATCACGATGAAAAGTATCAAGATGTTTTTGGATTCAATTTGAGAAACACAGAAGACTACAATTCCTACACTGAGTTTCTCGAAGATGTGGAACGATGTAATTATAATCTGAGCTTTAACAAAATCGCGGACACCTCGAATCTGATTAACGATGGTAAACTTTATGTTTTCCAGATCTGGAGCAAGGATTTCTCTATTGACAGCAAAGGCACCAAAAACCTGAACACCATTTACTTTGAAAGTCTCTTCAGCGAAGAAAATATGATTGAGAAAATGTTTAAACTTAGCGGTGAAGCTGAAATATTCTATCGCCCGGCAAGCCTGAACTATTGCGAAGACATTATCAAAAAGGGTCATCACCACGCTGAACTGAAAGATAAATTTGATTATCCTATCATAAAAGATAAACGCTATAGCCAGGATAAATTTTTTTTTCATGTTCCTATGGTCATTAACTACAAATCAGAAAAACTGAACTCTAAAAGCCTCAATAATCGAACCAATGAAAACCTTGGGCAGTTTACCCATATAATTGGAATTGATCGCGGAGAGCGTCATTTAATCTACCTGACCGTAGTCGATGTATCGACCGGCGAGATCGTCGAGCAGAAGCACTTAGACGAGATTATCAACACTGATACCAAAGGTGTTGAGCATAAGACGCACTATCTAAACAAGCTGGAGGAAAAATCGAAAACCCGTGATAATGAACGTAAGAGTTGGGAGGCAATTGAAACGATTAAAGAACTGAAGGAGGGTTATATCAGCCACGTAATCAATGAAATTCAAAAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAAATCTGAATTACGGTTTCAAAAATTCTCGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGACGGCATTAATTAAAAAGTTTAATTACATCATTGACAAAAAAGATCCGGAAACTTATATTCATGGCTATCAGCTGACGAACCCGATCACCACACTGGATAAAATTGGTAACCAGTCTGGTATCGTGCTTTACATCCCTGCCTGGAATACCAGTAAAATCGATCCGGTAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAGAATCAGGAACAGGCCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAATTCAAATTTGATATTGATTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGACGAAATGGACGCTCACCTCGTACGGAACCCGAATCCAGACATTCCGCAATCCGCAGAAGAACAATAAATGGGACAGCGCCGAGTATGATCTCACTGAAGAATTCAAATTGATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAACCTATAAAAAATTTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGATATCGATTATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCGCGCGAAAATATCAAAAATCTGCCCGCAGATGCCGACGCCAATGGGGCGTACAATATTGCACGCAAGGGTATCATGGCGATCGAAAACATTATGAATGGTATCAGCGACCCGCTGAAAATCTCAAACGAAGATTATTTGAAATATATCCAAAACCAGCAGGAATAA SEQATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCCGCID AGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAACGACCACNO:TACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTGGTT 52CAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTA SEQATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTGG IDAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTCAGGAANO: AACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATGCAA53 AGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACATCTAA SEQATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAACC IDCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAAGCGATCNO: TACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGAACT54GTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTGGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAACAGTCTATGTAA SEQATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTCGT IDGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGCTGTGCCNO: GTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTCAG55 GAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA SEQATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAGTT IDCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACCGAATTCNO:CGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATGTTC 56ACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGTGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTAASEQATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGAACID TGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGGAAGCGANO:TCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAGGTA 57CCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAA SEQATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTCTG IDGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCCAGAACNO:AAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCGGGT 58GCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAA SEQATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTCGT IDTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAAAAACGTNO:CGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTGCTG 59GACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG SEQATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAACC IDGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAACCTGCGTNO:AAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTGACC 60GACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAAACCG SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc61gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATCAGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc62gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGAGAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 63TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAACCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCTGACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCAAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTGGTGACTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc64gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACGGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGGTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATTCCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 65TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc66gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccatgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAAGTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGCAACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtgaGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc67gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGAGTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATTGACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 68TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGGGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGATAACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc69gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 70TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTTGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAAAATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATTCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATCGACTGGGACAACTTCGTTGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 71TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAAATTCGAAAACTTCACCGGTCTGTACCCGATCTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGGGTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGGAAAACGACAACTACCGTGCGGAAAAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAATGGTTCATCAACGAAACCCTGCACGACCTGCACATCAACTGGTCTGAACTGAAAGTTGCGCTGGAAAACAACCGTATCGAAAAATCTGACGCGTCTAAAAAAGAACTGCAGCGTGTTCAGAAAATCAAACGTGAAGAAATCTACAACGCGTTCATCGAACACGAAGCGTTCCAGTACCTGTTCAAAGAAAACCTGCTGTCTGACCTGCTGCCGATCCAGATCGAACAGTCTGAAGACCTGGACGCGGAAAAAAAAAAACAGGCGGTTGAAACCTTCAACCGTTTCTCTACCTACTTCACCGGTTTCCACGAAAACCGTAAAAACATCTACTCTAAAGAAGGTATCTCTACCTCTGTTACCTACCGTATCGTTCACGACAACTTCCCGAAATTCCTGGAAAACATGAAAGTTTTCGAAATCCTGCGTAACGAATGCCCGGAAGTTATCTCTGACACCGCGAACGAACTGGCGCCGTTCATCGACGGTGTTCGTATCGAAGACATCTTCCTGATCGACTTCTTCAACTCTACCTTCTCTCAGAACGGTATCGACTACTACAACCGTATCCTGGGTGGTGTTACCACCGAAACCGGTGAAAAATACCGTGGTATCAACGAATTCACCAACCTGTACCGTCAGCAGCACCCGGAATTCGGTAAATCTAAAAAAGCGACCAAAATGGTTGTTCTGTTCAAACAGATCCTGTCTGACCGTGACACCCTGTCTTTCATCCCGGAAATGTTCGGTAACGACAAACAGGTTCAGAACTCTATCCAGCTGTTCTACAACCGTGAAATCTCTCAGTTCGAAAACGAAGGTGTTAAAACCGACGTTTGCACCGCGCTGGCGACCCTGACCTCTAAAATCGCGGAATTCGACACCGAAAAAATCTACATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCGTCTGTTCGGTTCTTGGAACGAACTGAACGCGTGCCTGTTCAAATACGCGGAACTGAAATTCGGTACCGCGGAAAAAGTTGCGAACCGTAAAAAAATCGACAAATGGCTGAAATCTGACCTGTTCTCTTTCACCGAACTGAACAAAGCGCTGGAATTCTCTGGTAAAGACGAACGTATCGAAAACTACTTCTCTGAAACCGGTATCTTCGCGCAGCTGGTTAAAACCGGTTTCGACGAAGCGCAGTCTATCCTGGAAACCGAATACACCTCTGAAGTTCACCTGAAAGACCAGCAGACCGACATCGAAAAAATCAAAACCTTCCTGGACGCGCTGCAGAACCTGATGCACCTGCTGAAATCTCTGTGCGTTTCTGAAGAAGCGGACCGTGACGCGGCGTTCTACAACGAATTCGACATGCTGTACAACCAGCTGAAACTGGTTGTTCCGCTGTACAACAAAGTTCGTAACTACATCACCCAGAAACTGTTCCGTTCTGACAAAATCAAAATCTACTTCGAAAACAAAGGTCAGTTCCTGGGTGGTTGGGTTGACTCTCAGACCGAAAACTCTGACAACGGTACCCAGGCGGGTGGTTACATCTTCCGTAAAGAAAACGTTATCAACGAATACGACTACTACCTGGGTATCTGCTCTGACCCGAAACTGTTCCGTCGTACCACCATCGTTTCTGAAAACGACCGTTCTTCTTTCGAACGTCTGGACTACTACCAGCTGAAAACCGCGTCTGTTTACGGTAACTCTTACTGCGGTAAACACCCGTACACCGAAGACAAAAACGAACTGGTTAACTCTATCGACCGTTTCGTTCACCTGTCTGGTAACAACATCCTGATCGAAAAAATCGCGAAAGACAAAGTTAAATCTAACCCGACCACCAACACCCCGTCTGGTTACCTGAACTTCATCCACCGTGAAGCGCCGAACACCTACGAATGCCTGCTGCAGGACGAAAACTTCGTTTCTCTGAACCAGCGTGTTGTTTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGTGTTCCGAAAGCGCTGGTTTACGCGAAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGACGAACTGTCTTACGAAAAAGCGTTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAACTCTTCTAACCGTACCATCAAACCGCTGCTGCTGTTCAAAATCTCTAACAAAGACCTGTCTTTCGCGGAAAACTTCGAAAAAGGTAACCGTCAGAAAATCGGTAAAAAAAACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTAACCAGGACACCATCGACATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAAAAACCCTGAAATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGATCATCAAAGACAAACGTTTCGCGTCTGACAAATTCCTGTTCCACCTGTCTACCGAAATCAACTACAAAGAAAAACGTAAACCGCTGAACAACTCTATCATCGAATTCCTGACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTGGTGAACGTCACCTGATCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTCAACATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGTGACAACGCGCGTAAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAGGTTTCCTGTCTCTGGTTATCCACGAAATCGCGAAAATCATGGTTGAAAACAACGCGATCGTTGTTCTGGAAGACCTGAACTTCGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGACAAACTGAACTACCTGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTACCAGCTGGCGGAAAAATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACATGCTGAACCTGAACTACACCAACATGAAAGACGCGCAGACCCTGCTGTCTGGTATGGACAAAATCTCTTTCAACGCGGACGCGAACTACTTCGAATTCGAACTGGACTACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAATGGACCATCTGCACCGTTGGTGAAAAACGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACCGTTAACGTTACCGAAGACCTGAAAAAACTGCTGGACAAATTCGAAGTTAAATACTCTAACGGTGACAACATCAAAGACGAAATCTGCCGTCAGACCGACGCGAAATTCTTCGAAATCATCCTGTGGCTGCTGAAACTGACCATGCAGATGCGTAACTCTAACACCAAAACCGAAGAAGACTTCATCCTGTCTCCGGTTAAAAACTCTAACGGTGAATTCTTCCGTTCTAACGACGACGCGAACGGTATCTGGCCGGCGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCTGTACCTGGTTAAAGAATGCTTCAACAAAAACGAAAAATCTCTGAAAATCGAACACAAAAACTGGTTCAAATTCGCGCAGACCCGTTTCAACGGTTCTCTGACCAAAAACGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 72TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAACGACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 73TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACATCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 74TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAAGCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTGGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAACAGTCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 75TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTCGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 76TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAGTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGTGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 77TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGAACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 78TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTCTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCCAGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 79TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTCGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTCIDTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGANO:CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 80TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAAACCGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcatt IDctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaaNO:gtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcg 81gatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc SEQTGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTID GTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGNO:TCCACATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGAT82CCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGTAGCGGATCCTACCTGAC SEQAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTTCTAGAGCACAGCTIDAACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAGTGGTTGCTGGATAACTTTACGGGCATGCATNO:AAGGCTCGTAATATATATTCAGGGAGACCACAACGGTTTCCCTCTACAAATAATTTTGTTTAACTTTTAC83 TAGAGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCA SEQ GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC ID NO: 84SEQ actacattttttaagacctaattttgagt ID NO: 85 SEQctcaaaactcattcgaatctctactctttgtagat ID NO: 86 SEQCTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT ID NO: 87 SEQCCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT ID NO: 88 SEQGTCTAGGTACTCTCTTTAATTTCTACTATTGT ID NO: 89 SEQgttaagttatatagaataatttctactgttgtaga ID NO: 90 SEQgtttaaaaccactttaaaatttctactattgta ID NO: 91 SEQGTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTCAAID ACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT NO: 92 SEQCTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT ID NO: 93 SEQGTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT ID NO: 94 SEQGTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT ID NO: 95 SEQGTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGCT IDTCTACGGAAGTGGCAC NO: 96 SEQCGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTGTGAGAAACTCCTATTID GCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC NO: 97 SEQGAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCCTTCAAGTCTGAGGCGID AAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGTAGTTAGAGG NO: 98 SEQAATTACTGATGTTGTGATGAAGG ID NO: 99 SEQ TATACCATAAGGATTTAAAGACT ID NO: 100SEQ GTCTTTACTCTCACCTTTCCACCTG ID NO: 101 SEQATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG ID NO: 102 SEQGTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG ID NO: 103 SEQTTGTTTTAATACCATATTTTTACATCACTCTCAAAC ID NO: 104 SEQAAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACCCTGGGAGGCGCTCG IDAATCATAGGTGGGACAAGGGATTCGCGGCGAAAA NO: 105 SEQGTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTCAAID ACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT NO: 106 SEQGTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGCT IDTCTACGGAAGTGGCAC NO: 107 SEQMSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVCIDISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLILNO:WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPK108FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEYITQQIAPKNLDNPSKKEQELIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQNKDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSEDKANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGSPQKGYEKFEFNIEDCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENISESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVHEIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLKGLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN SEQMSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVCIDISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLILNO:WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPK109FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEYITQQVAPKNLDNPSKKEQDLIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQNKDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSEDKANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGNPQKGYEKFEFNIEDCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENISESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVHEIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLKGLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN SEQMDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYIDTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTNO:DKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSR110RLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD SEQ PKKKRKV IDNO: 111 SEQ KRPAATKKAGQAKKKK ID NO: 112 SEQ PAAKRVKLD ID NO: 113 SEQRQRRNELKRSP ID NO: 114 SEQ NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY ID NO:115 SEQ RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV ID NO: 116 SEQVSRKRPRP ID NO: 117 SEQ PPKKARED ID NO: 118 SEQ PQPKKKPL ID NO: 119 SEQSALIKKKKKMAP ID NO: 120 SEQ DRLRR ID NO: 121 SEQ PKQKKRK ID NO: 122 SEQRKLKKKIKKL ID NO: 123 SEQ REKKKFLKRR ID NO: 124 SEQ KRKGDEVDGVDEVAKKKSKKID NO: 125 SEQ RKCLQAGMNLEARKTKK ID NO: 126 SEQATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATGCAGTAACTGATCCCTID CGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGGAGCACATGTATTTGCTGCGGGTAATCAGANO: GTGCCGAAAGGCGATCTTTCAGAACATCCAGGAGGCGATTAGATAGGAGACAGCAAAGAGTAAAGCTT127GTGCAAGAGATCTTTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTGCACGAATCGGCTCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGATCCCACTTATACAGACAAGGAATACTACTCCGACTACCCGACAATTCATCATTTGATCGTCGATCTTATGGAGAGCTCTGAAAAGCATGACCCCCGACTTGTCTATTTGGCTGTAGCTTGGTTAGTTGCTCATAGAGGTCATTTCTTGAATGAAGTAGATAAAGACAATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCGGAATTTTTGGCCTTTTTGTCAGACAATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAGCTACTCTGCTGTCTAGGAATAGCGTCAACGACAAATATAAAGCATTAAAATCGCTGATATTCGGATCGCAAAAACCGGAAGATAACTTTGACGCTAACATCTCTGAAGATGGTTTAATCCAATTGCTGGCGGGTAAGAAAGTTAAAGTAAACAAACTATTCCCACAAGAGTCCAACGATGCTAGCTTTACGTTGAATGATAAAGAAGACGCTATTGAAGAAATTCTAGGTACTTTAACGCCTGACGAGTGCGAATGGATCGCTCATATTCGCAGATTGTTCGATTGGGCCATCATGAAACACGCGCTAAAGGATGGCAGGACGATATCTGAATCAAAAGTGAAGCTATACGAGCAGCATCATCATGACTTGACTCAGTTAAAGTACTTTGTGAAGACCTACCTAGCTAAAGAGTATGATGATATCTTCAGAAACGTAGACTCCGAGACAACTAAAAATTATGTAGCTTATTCTTACCATGTGAAGGAAGTGAAAGGCACATTACCAAAAAATAAAGCAACGCAAGAAGAATTTTGTAAATACGTCCTTGGCAAAGTCAAAAACATTGAATGTTCCGAAGCAGACAAGGTTGATTTTGATGAAATGATACAACGACTTACGGACAATTCTTTTATGCCAAAGCAAGTCTCAGGTGAAAATAGAGTAATACCATACCAGTTGTACTACTATGAATTAAAGACAATTTTAAACAAAGCCGCCTCATATCTACCTTTTTTGACACAATGCGGTAAAGATGCTATTTCTAACCAAGACAAATTACTGTCTATAATGACATTTCGCATACCATATTTCGTCGGCCCTTTAAGGAAAGATAATTCAGAACATGCCTGGTTGGAACGTAAAGCGGGTAAAATTTACCCGTGGAACTTTAATGATAAAGTAGATCTTGATAAATCGGAGGAAGCCTTTATCCGTAGGATGACCAATACTTGCACGTATTACCCAGGAGAAGACGTGTTACCATTAGATTCACTTATCTATGAAAAGTTTATGATCTTGAATGAGATAAACAATATTAGGATTGACGGATACCCCATTTCTGTTGATGTGAAACAACAAGTATTTGGTTTATTTGAGAAGAAAAGGCGAGTAACAGTTAAGGATATTCAAAATCTACTATTATCTCTTGGAGCGTTGGATAAACACGGTAAGCTGACTGGTATTGACACGACAATACACTCTAATTATAACACTTATCATCATTTTAAATCTCTTATGGAGCGGGGAGTATTGACCAGAGATGATGTGGAAAGAATAGTGGAAAGAATGACATATTCTGACGATACTAAGAGGGTCAGACTGTGGTTAAATAATAATTATGGAACTCTAACAGCTGACGATGTTAAGCATATCTCAAGACTCAGAAAACACGATTTCGGCCGTTTGTCTAAAATGTTTTTGACAGGATTGAAAGGTGTTCATAAGGAGACAGGCGAGAGAGCAAGTATACTGGATTTTATGTGGAATACTAACGACAATTTAATGCAACTACTGTCCGAATGTTACACATTCTCGGATGAGATCACCAAATTACAAGAGGCCTACTACGCAAAAGCTCAATTATCGCTAAATGACTTCTTGGACTCTATGTATATATCAAACGCCGTTAAGAGACCTATTTATCGGACCTTAGCGGTAGTAAATGATATTAGAAAGGCATGCGGGACGGCACCTAAAAGAATTTTCATCGAGATGGCGCGAGATGGAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGAGAGCAAATTAAAAATCTCTATAGATCAATTCGTAAAGACTTTCAACAAGAAGTTGATTTTCTGGAAAAGATATTGGAAAATAAGAGTGACGGGCAGCTTCAGTCTGACGCTTTATATTTGTATTTTGCTCAATTAGGCAGAGACATGTACACAGGTGATCCAATCAAATTAGAACATATTAAAGACCAATCTTTTTACAACATTGATCATATTTATCCTCAATCGATGGTGAAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCAATGGCGAAAAGAGTTCGCGCTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGCCGTTGTGGGATGCCTACTATAATCATGGATTAATTTCTCTTAAGAAATACCAGCGTTTGACGAGATCTACTCCATTTACGGACGACGAGAAGTGGGATTTTATCAATCGTCAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAATATTGTTAAAGCGTAAGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTAGACATGAATTCGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAAGATGCCTTTCTCGCAATAGTTACTGGGAACGTTTATCATGAAAGATTTAATAGAAGATGGTTTATGGTTAACCAGCCATACTCTGTGAAAACTAAGACATTGTTTACCCATTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAGAAGAAGATCTTGGACGTATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTTTTCCTTTGATAGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATTGGTTCCACGAAAAGCAGGGTTGGATGTTGTTAAATATGGAGGATACGATAAAAGCACTGCCGCGTATTATTTATTAGTACGTTTTACACTCGAGGATAAGAAGACTCAACACAAATTGATGATGATTCCTGTTGAAGGTCTCTACAAAGCACGTATTGACCATGATAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATTCTTCAAAAGGACAAGCAGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAACTGAATTCGATGATTTCTATTGATGGATTCTATCTTTCTATTGGTGGGAAGAGTAGCAAAGGTAAGTCAGTACTATGTCATGCTATGGTGCCATTAATCGTCCCACACAAGATAGAATGTTATATCAAGGCTATGGAATCGTTTGCAAGAAAATTCAAAGAAAATAATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAACTGTTGAAGATAACTTGAACTTATACGAGCTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAATTTTTCTCTACACAATTTGATGTGTTGACGAACGGTAGAAGTACATTCACCAAATTGTCTCCAGAGGAGCAAGTCCAGACTTTACTTAATATACTGAGTATATTTAAAACTTGTCGTTCTTCTGGGTGTGATTTAAAATCAATAAATGGTTCCGCTCAAGCGGCTAGAATTATGATATCCGCTGATTTAACTGGCTTATCAAAAAAGTATTCAGATATTAGATTAGTTGAGCAAAGCGCATCAGGTCTATTTGTTTCAAAATCTCAAAATCTCTTGGAATACTTGCCAAAAAAGAAAAGGAAAGTTTAGSEQATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGAACGAATTGATACCG IDGTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGATTGATGGAGACGAGCAATTGAATGAGAA NO:TTATCAAAAAGCAAAGATAATAGTAGATGATTTTTTGAGAGACTTTATTAATAAAGCTCTAAATAACACT128 CAAATTGGTAACTGGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAATATTGAGAAATTACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTCAGTTCGTATTCGATTAAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGGAAGAAGAGGAGCTAGACCTTGGGAAGAAGACATCTAGCTTCAAATACATATTCAAGAAAAATTTGTTCAAACTTGTCCTTCCTTCATATTTAAAAACAACAAATCAAGATAAGTTAAAAATCATTTCTTCCTTCGATAATTTTAGTACTTATTTTCGTGGTTTTTTCGAAAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTCTATAGCTTATAGAATTGTTCACGATAATTTCCCAAAATTTCTAGATAATATCAGGTGTTTTAATGTTTGGCAAACCGAGTGTCCTCAGTTAATAGTCAAGGCCGACAACTACCTTAAAAGCAAGAATGTGATTGCAAAAGATAAGTCTTTGGCTAACTATTTTACAGTCGGTGCCTATGATTATTTTCTGAGTCAAAATGGTATCGATTTCTATAACAACATTATTGGCGGCTTACCAGCTTTTGCCGGGCATGAGAAGATTCAGGGTTTGAACGAATTTATCAATCAAGAATGTCAAAAGGATTCTGAATTAAAGTCTAAGCTCAAGAATAGGCACGCTTTCAAAATGGCAGTCTTATTCAAACAAATCCTTTCAGACAGAGAAAAGTCATTTGTGATTGACGAGTTCGAATCAGACGCTCAGGTAATTGATGCTGTTAAAAATTTTTACGCGGAACAATGCAAAGATAATAACGTCATATTTAATTTATTGAATCTGATCAAGAATATTGCTTTTTTGTCGGATGATGAGTTAGACGGCATTTTCATAGAGGGTAAATACCTGTCCTCTGTGTCTCAAAAATTGTATAGTGATTGGTCAAAGTTGAGAAATGATATTGAAGATTCGGCTAATTCTAAACAGGGTAACAAAGAATTAGCGAAGAAAATCAAAACTAACAAGGGTGATGTTGAAAAGGCTATAAGTAAGTACGAGTTCAGTTTATCTGAACTAAATTCAATTGTTCATGATAACACAAAATTTTCCGATCTTTTATCATGCACATTACATAAAGTTGCAAGTGAAAAATTAGTCAAAGTAAACGAAGGTGATTGGCCAAAACATCTAAAAAACAACGAGGAAAAACAGAAGATAAAAGAACCTCTTGACGCTTTATTGGAAATATACAATACTCTATTAATATTTAACTGTAAAAGTTTTAACAAAAATGGTAATTTCTATGTCGACTACGATCGCTGCATTAATGAGTTGTCCAGTGTTGTGTACTTGTATAATAAAACTCGTAATTATTGTACGAAAAAGCCGTACAACACTGACAAATTTAAGTTGAATTTCAACTCCCCACAACTGGGTGAGGGCTTCTCTAAAAGTAAAGAGAATGATTGCCTTACATTATTATTTAAAAAAGATGATAATTATTATGTCGGAATCATAAGAAAGGGGGCAAAGATCAACTTCGATGACACTCAGGCCATAGCAGACAACACAGATAACTGTATATTCAAAATGAATTATTTTTTGCTGAAGGATGCTAAAAAATTTATCCCCAAATGTTCAATACAATTAAAAGAGGTTAAGGCCCATTTCAAAAAGTCGGAAGATGACTATATTTTGTCCGATAAGGAAAAATTCGCTAGTCCGCTTGTTATTAAAAAATCCACATTTCTTCTCGCTACGGCTCATGTGAAAGGAAAGAAGGGCAATATTAAGAAATTTCAGAAAGAATACTCCAAAGAAAATCCTACGGAGTATAGAAATAGTCTGAACGAATGGATAGCATTCTGCAAAGAGTTCTTGAAGACCTATAAAGCTGCCACCATCTTTGATATTACAACTTTGAAAAAGGCCGAGGAATACGCTGACATTGTGGAATTCTATAAGGATGTAGATAATCTTTGTTACAAGTTAGAATTTTGCCCTATCAAAACTTCTTTTATCGAAAATCTTATAGATAATGGCGATTTATACCTGTTTAGAATTAATAACAAGGACTTTTCTTCAAAAAGTACAGGCACGAAAAACTTACACACATTATACTTGCAGGCTATATTTGACGAGCGAAACTTAAACAACCCCACGATAATGTTGAATGGAGGTGCAGAGTTATTCTACAGAAAAGAATCTATAGAACAGAAAAATCGGATCACGCACAAAGCCGGTAGTATCTTAGTGAATAAAGTGTGCAAAGATGGTACAAGTCTAGATGACAAAATCCGTAACGAAATTTACCAGTATGAAAACAAATTCATTGATACTCTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGAAAGAGGCTACGCATGATATAACAAAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTCATTGTCCTTTAACAATCAACTACAAGGAAGGTGACACCAAACAATTTAATAATGAAGTGCTCTCATTCCTTAGAGGTAACCCCGATATCAATATTATCGGCATTGATAGAGGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCAAAAAGGCGAAATATTGGATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAACTGTTGATTACGAAGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGATCTTGGGATAGCATATCCAAGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGATCGTACATGAAATCTGCTTATTAATGATTAAGCATAATGCTATTGTCGTGTTAGAAAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTGGTTTGTCAGAAAAGTCAGTATATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTTAGCAAGAAAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGATCAATTTGAGTCTTTCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTACCAGCCGCGTACACTTCAAAAATTGACCCTACGACCGGATTTGCCAACGTCTTGAATTTGTCCAAGGTCAGAAATGTTGACGCCATCAAAAGTTTTTTTAGCAACTTCAATGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTTGACCTAGACTCGTTATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGAATGTATACACATTCGGCGAGAGAATTATCAAGCCCAAGAACAAACAGGGCTATAGAGAAGACAAGAGAATCAACTTGACTTTTGAGATGAAAAAATTACTCAACGAATACAAGGTTTCATTTGATTTGGAGAACAACTTGATTCCCAATTTGACATCAGCTAACTTGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCAAAACGACATTACAACTGCGTAATAGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAAAGAATGCCAAAGGAGAATTTTTCGTATCCGGTACTCACAATAAGACACTACCACAGGATTGCGACGCTAACGGTGCGTATCATATTGCGTTGAAAGGATTAATGATACTTGAAAGAAATAACCTTGTTCGCGAAGAAAAAGACACCAAGAAGATCATGGCTATTAGCAATGTTGATTGGTTTGAATACGTGCAAAAGAGGAGAGGTGTTTTGTAA SEQATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGTTCGAATTGAAACCA IDCAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTTTGAAGAGGACAGAGACAGAGCGGAGAA NO:ATACAAAATTTTAAAAGAGGCCATCGATGAATATCACAAAAAGTTTATCGACGAGCATTTAACAAACAT129GTCTTTGGACTGGAATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAGGAAAAAGACAAAAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTTAAAAAAGATGATCGGTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGTTGAAGGAGGAGATATACAAGAAAGGCAACCATCAAGAAATAGATGCTTTGAAATCGTTTGACAAGTTCAGCGGTTACTTCATTGGTTTACATGAAAATAGGAAGAACATGTATAGCGACGGCGATGAGATCACCGCTATATCGAATAGAATCGTTAACGAAAATTTTCCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGGAAAAAGTACCCTGAATGGATAATAAAGGCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGATGAAGTCTTCTCGCTGGAATATTTTAACAAAGTATTAAATCAGGAAGGAATCCAAAGATACAACTTAGCCTTGGGTGGATACGTAACCAAATCAGGTGAGAAAATGATGGGCTTAAATGATGCACTTAATCTAGCTCACCAATCCGAAAAGTCCTCTAAAGGGAGGATACACATGACACCATTGTTTAAGCAAATCCTTTCGGAGAAAGAATCTTTTTCATATATCCCCGATGTTTTCACTGAGGATAGTCAATTGTTGCCCAGCATTGGTGGATTTTTTGCACAAATAGAAAATGATAAAGATGGTAACATCTTCGATAGAGCCTTGGAATTGATAAGCTCCTATGCAGAATACGATACGGAACGAATATACATTAGACAAGCTGACATCAACAGAGTAAGCAATGTTATTTTTGGTGAGTGGGGAACTTTAGGTGGATTAATGCGGGAGTACAAAGCTGACTCAATCAATGATATTAATTTGGAACGTACGTGCAAAAAAGTCGATAAGTGGCTTGATAGTAAGGAGTTTGCTCTGTCGGATGTACTAGAAGCAATTAAGAGAACAGGAAACAATGATGCATTTAATGAATATATTAGTAAAATGAGGACGGCTAGAGAAAAGATAGACGCCGCACGTAAGGAAATGAAGTTTATTTCCGAGAAAATATCTGGCGATGAAGAGTCGATTCACATCATCAAGACCCTACTCGATTCTGTTCAGCAATTTCTCCATTTTTTTAACCTCTTCAAAGCAAGACAAGACATTCCCTTAGATGGGGCTTTTTATGCCGAATTTGATGAAGTTCATTCAAAGTTGTTTGCTATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTACTAAAAATAACTTGAACACCAAGAAAATAAAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGGGATCAGAATAAAGTTTATGATTATGCTAGCTTAATATTTCTAAGAGATGGGAATTATTACTTAGGAATCATCAATCCAAAGCGTAAGAAAAACATTAAATTTGAACAAGGGTCAGGCAATGGCCCATTCTATAGAAAAATGGTGTATAAGCAAATACCAGGACCTAACAAGAACTTGCCTCGCGTATTTTTAACTTCAACAAAGGGTAAAAAAGAATATAAACCAAGCAAAGAAATTATTGAAGGTTACGAAGCAGATAAACACATCAGAGGTGATAAGTTCGATCTGGATTTCTGCCATAAATTGATTGACTTTTTTAAGGAATCTATAGAAAAACATAAGGACTGGTCCAAATTTAATTTCTACTTCTCACCTACAGAAAGTTATGGTGACATTTCAGAATTTTATTTAGACGTTGAGAAACAAGGATATAGGATGCATTTTGAAAATATTTCAGCGGAAACCATCGACGAATACGTTGAGAAGGGTGATTTATTCTTGTTCCAAATTTACAATAAAGACTTCGTTAAAGCTGCAACCGGAAAGAAGGATATGCATACCATATATTGGAACGCTGCATTCTCGCCAGAAAACTTACAAGATGTCGTTGTAAAGCTTAATGGAGAAGCTGAGCTGTTCTATAGAGACAAGAGTGATATAAAAGAGATTGTGCATCGGGAAGGTGAAATTCTGGTGAACAGAACTTACAATGGTCGTACACCCGTTCCAGACAAAATACATAAAAAACTGACCGATTATCATAATGGTAGGACAAAGGACTTGGGCGAGGCCAAGGAGTACCTCGATAAAGTTAGATATTTCAAGGCACACTATGATATTACGAAAGACAGGAGATATTTAAACGATAAAATTTACTTTCATGTCCCTTTGACCCTTAACTTTAAAGCTAATGGTAAAAAGAATTTGAACAAAATGGTAATTGAGAAGTTTTTATCGGACGAAAAAGCTCACATAATCGGAATCGACCGCGGAGAGAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGATTATTGATCAGCAATCTTTGAACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGGAAATTGAAATGAAGGATGCGAGACAATCATGGAATGCCATTGGTAAAATTAAAGATCTCAAGGAGGGGTACTTATCAAAAGCTGTACACGAGATAACTAAAATGGCTATCCAATATAATGCAATTGTTGTAATGGAAGAATTGAATTATGGTTTTAAACGCGGCAGGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAACATGTTAATTGATAAGATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGCTGAACGCCTATCAATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAACTGGCATTCTATTTTATGTTCCAGCCGCTTACACCTCAAAGATCGATCCAACGACCGGTTTTGTAAACTTATTTAATACTTCTTCCAAAACAAACGCGCAAGAACGCAAAGAATTCCTACAAAAATTTGAATCAATATCCTATAGCGCAAAAGATGGAGGTATATTCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCATAAAAATGTGTGGACCGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAGAGGAACGAACTATTTGATCCATCGAAAGAAATTAAAGAAGCTTTGACAAGCAGCGGAATCAAATATGATGGAGGTCAAAACATACTTCCAGATATTCTCAGATCTAATAATAACGGTCTTATTTACACGATGTATTCATCTTTTATCGCTGCCATCCAAATGCGTGTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAATTCAAAGGGTGAATTTTTTCGCACGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAACGGTGCTTACAATATTGCATTGCGTGGTGAACTTACTATGAGAGCCATCGCCGAAAAGTTTGATCCGGACAGTGAAAAAATGGCGAAATTGGAGCTAAAGCACAAGGATTGGTTTGAATTCATGCAGACCCGTGGCGATTGA SEQATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCGTTAGGTTTGAATTGAID AACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAAAAACGAGGGATTGAAAAGAGTGGTTAGTGNO: AAGATGAAAGAAGGGCAGTAGACTATCAAAAGGTTAAAGAAATCATTGACGATTACCACAGAGATTTT130ATAGAAGAATCTCTGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAGCGTTTCATTTGTATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAATGGGAAGCATTACAGAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACTCTAATAAAGCTCGCTTTTCTAGAATCGATAAAAAAGAATTGATTAAGGAAGATTTAATAAATTGGCTGGTAGCACAAAACAGAGAGGATGATATTCCTACTGTTGAAACGTTCAATAATTTTACTACTTACTTCACTGGTTTCCATGAGAACAGGAAGAATATTTACTCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCACGAAAACTTGCCTAAATTTTTTGACAATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCGGAATTAAAATTCGACAAAGTTAAAGAAGATTTAGAAGTAGATTACGACTTGAAGCATGCGTTTGAAATTGAATATTTCGTTAATTTCGTCACACAAGCTGGTATCGACCAATATAATTACCTGCTTGGAGGCAAAACTCTAGAAGACGGTACGAAGAAACAAGGAATGAATGAACAGATTAATTTATTTAAGCAACAACAAACTCGCGATAAAGCTAGACAGATTCCAAAACTGATTCCACTTTTCAAACAGATTCTATCTGAGAGAACTGAATCTCAGAGTTTTATCCCTAAGCAGTTCGAGTCTGATCAGGAACTATTCGATTCCCTGCAGAAATTGCATAACAACTGTCAAGATAAGTTTACCGTTTTGCAACAGGCGATCTTGGGATTGGCTGAGGCAGATCTTAAAAAGGTCTTTATTAAAACTAGTGATCTAAACGCATTGTCTAACACTATTTTTGGAAATTATTCTGTGTTCTCAGACGCGCTCAATTTATATAAAGAGTCGCTAAAAACTAAAAAGGCTCAAGAAGCTTTTGAAAAGTTGCCTGCACATAGTATTCATGATTTAATCCAATACTTAGAACAATTTAATTCGTCTCTCGATGCTGAAAAGCAACAGTCTACCGATACTGTATTAAACTACTTTATTAAAACCGACGAATTATATAGTCGTTTCATTAAATCCACCTCTGAGGCATTCACCCAAGTACAACCTCTCTTTGAACTGGAAGCTTTGAGCTCCAAAAGAAGACCCCCAGAAAGTGAAGATGAGGGGGCTAAAGGCCAAGAAGGTTTCGAACAAATTAAGAGAATCAAAGCTTATCTAGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCCTTTGTATCTTGTCAAGGGTAGAAAGATGATAGAGGGTCTAGACAAGGATCAAAGCTTCTACGAAGCGTTTGAAATGGCCTACCAGGAGTTGGAGTCTTTAATCATCCCCATTTACAATAAGGCCAGATCTTACCTGTCTAGGAAGCCATTTAAAGCGGATAAATTCAAAATTAATTTTGACAATAATACACTTCTATCTGGGTGGGATGCTAACAAGGAGACGGCTAACGCCAGCATATTGTTTAAGAAGGATGGTTTATACTACCTGGGAATCATGCCAAAAGGCAAAACTTTCTTGTTCGATTATTTCGTTAGTTCAGAAGATTCTGAAAAGTTGAAACAACGGAGACAGAAAACCGCAGAGGAAGCGCTCGCACAGGATGGAGAATCCTATTTTGAAAAAATACGGTATAAACTCCTACCAGGTGCTAGTAAGATGTTGCCAAAGGTATTTTTTAGCAATAAAAATATTGGGTTTTACAATCCCTCAGATGATATTCTACGAATTCGGAATACGGCCTCTCATACTAAGAATGGTACTCCCCAGAAGGGTCATTCCAAGGTAGAATTTAACTTGAATGACTGTCACAAAATGATTGATTTTTTTAAATCTTCCATACAGAAACATCCCGAGTGGGGATCCTTTGGTTTCACTTTTTCTGATACGTCGGACTTTGAAGATATGAGTGCTTTCTACCGAGAAGTTGAAAATCAAGGTTACGTTATAAGTTTTGATAAAATAAAAGAAACTTACATTCAGTCTCAAGTTGAGCAAGGTAACTTATATTTATTTCAAATTTACAACAAAGATTTTAGTCCGTATTCAAAGGGAAAGCCAAACCTGCACACTTTATACTGGAAAGCTCTGTTTGAAGAGGCTAATTTGAATAACGTAGTGGCTAAGCTAAACGGCGAAGCAGAAATCTTTTTCAGAAGACACAGTATCAAAGCATCTGATAAAGTGGTACATCCTGCTAATCAAGCTATAGATAATAAGAATCCCCATACTGAGAAGACGCAGTCCACATTTGAATATGACTTGGTCAAAGACAAAAGATATACCCAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCTCAGGGCGTTTCAAAGTTTAATGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACGTTAATATAATCGGTATAGATCGAGGTGAGAGACATCTTTTATACTTTACCGTGGTGAATCAAAAAGGAGAAATATTAGTGCAAGAGTCCTTGAATACATTAATGTCTGACAAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAGAAGGAACAGGAAAGGGACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGAAGGTTATTTATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCAATAGTGTGCTTGGAAGATCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCGAAAAACAGGTCTACCAGAAGTTCGAGAAAGCTCTGATCGATAAATTGAATTATCTTGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATTATCTTACAGCATACCAACTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATTTTGTTCTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAATTTTTTAGATCTTAGGTACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGATTTTAATGCGATAAGATTTAATAGTGTTCAGAATTATTTTGAGTTCGAAATTGATTATAAAAAATTGACACCAAAACGTAAAGTAGGAACACAATCTAAATGGGTTATTTGTACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAAGGTCACTGGGAAACTGAAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCAAAAACGACGACGGTGATCGATTATGCAAATGATGATAACCTTATTGATGTAATTCTGGAACAAGATAAGGCATCATTTTTTAAAGAACTACTATGGTTGTTAAAGCTAACCATGACCCTAAGGCACTCCAAGATAAAGTCAGAGGATGATTTTATCCTCTCTCCAGTGAAAAACGAACAAGGTGAGTTTTACGACTCAAGAAAGGCGGGTGAAGTCTGGCCTAAGGATGCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGGGGCTATGGAACTTACAGCAAATTAACCAATGGGAAAAAGGTAAAACTTTAAACCTCGCCATAAAGAACCAGGATTGGTTCAGCTTTATCCAAGAAAAACCATATCAAGAATAA SEQATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATCCGCTGTCCAAAACTTIDTGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAACCTAGAGGCTTCTGGTTATTTGGCGGAAGANO:TAGACATAGAGCTGAGTGTTATCCCCGAGCTAAAGAATTGCTGGATGATAACCACAGGGCGTTCCTGAA131TAGAGTTCTACCGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAAGGTGCACAAAAATCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGAAGAAAAGAAATTTCCGCTTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCGCGAAACCTGCCCTGGACGAAGCTATGAAAATAGCTAAGGAAAACGGCAATGAATCTGATATTGAAGTTTTGGAAGCCTTCAATGGATTTTCCGTTTATTTCACTGGTTATCATGAGAGTAGGGAGAATATATACTCAGACGAAGATATGGTATCCGTCGCCTATCGCATAACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTTTTGATAAACTAAATGAATCGCACCCGGATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTAGACGATATTGGTAAATATTTTGATGTGTCCAACTACAATAATTTCCTTAGTCAAGCAGGAATTGATGACTACAACCATATTATAGGAGGGCATACAACTGAAGACGGTCTCATTCAAGCTTTTAACGTAGTGTTAAACCTAAGGCACCAAAAAGACCCAGGTTTTGAGAAAATTCAATTTAAGCAACTCTACAAGCAGATACTGAGCGTTAGGACTAGTAAGTCATATATCCCAAAGCAATTCGATAACTCAAAGGAAATGGTCGACTGTATATGCGACTACGTCTCAAAAATAGAAAAATCTGAAACAGTAGAAAGAGCTCTGAAATTGGTAAGAAATATATCTTCTTTTGATTTAAGAGGTATTTTCGTAAATAAAAAAAACCTTCGAATTTTGTCTAATAAGTTAATTGGAGACTGGGACGCAATAGAGACAGCTTTGATGCACAGTTCCAGCAGTGAAAACGATAAGAAATCAGTGTATGACTCTGCAGAGGCATTCACCCTTGATGATATCTTCAGTTCTGTGAAAAAGTTCAGCGACGCCTCCGCTGAGGATATAGGAAACCGCGCTGAAGACATATGTCGTGTTATCTCAGAAACAGCTCCTTTCATTAACGACTTAAGGGCTGTAGATTTGGATTCTTTAAATGATGACGGCTATGAAGCGGCCGTGTCTAAAATACGGGAATCTCTTGAACCCTACATGGATCTATTTCACGAATTGGAGATCTTTAGCGTGGGTGATGAGTTTCCTAAATGTGCTGCCTTTTATAGCGAGTTGGAAGAGGTCTCAGAACAACTGATTGAAATCATTCCTTTATTTAACAAAGCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCGACAAAATCAAAGTCAATTTAAAATTCCCTACTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGATAACAAAGCCGCAATTCTAAGAAAAGACGGTAAATACTACCTGGCAATTTTAGACATGAAGAAAGATCTCAGTAGTATTCGTACGAGCGATGAGGACGAGTCTTCTTTTGAAAAGATGGAATATAAATTGCTCCCTTCTCCTGTGAAAATGCTTCCAAAAATTTTTGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGTTGACCGATAGAATGTTAGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCTTTTGATTTGGGTTTTTGTCATGAATTGATCGATTACTATAAGCGCTGCATTGCCGAGTACCCAGGCTGGGATGTTTTCGACTTTAAATTTCGTGAGACAAGCGATTACGGATCCATGAAAGAATTTAATGAAGACGTCGCTGGCGCAGGTTACTATATGTCACTTAGAAAGATTCCATGTTCCGAAGTTTATCGTTTACTGGACGAGAAGTCAATTTACTTGTTTCAAATATATAATAAGGATTATAGCGAAAACGCACATGGGAATAAGAATATGCATACGATGTATTGGGAGGGCTTGTTCTCACCACAAAATTTGGAATCACCAGTCTTCAAATTGTCCGGAGGCGCAGAACTTTTTTTCAGAAAGTCATCTATTCCTAATGACGCTAAAACGGTACATCCGAAAGGTTCAGTTCTTGTTCCCAGAAACGACGTCAATGGTAGAAGAATACCAGACTCGATCTACAGAGAGTTGACAAGGTATTTTAACCGTGGGGATTGCAGGATCAGTGATGAAGCTAAGTCTTACCTGGACAAGGTCAAGACAAAAAAAGCGGACCATGACATTGTTAAGGATAGAAGATTTACTGTAGATAAGATGATGTTCCATGTTCCGATTGCCATGAATTTTAAAGCTATAAGTAAACCAAATCTTAATAAGAAAGTTATTGATGGCATAATAGATGATCAAGATTTGAAAATCATCGGTATCGATCGTGGTGAGAGAAATCTTATTTATGTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTCTTAATATTTTAAATGGATACGATTACCGCAAAGCTTTAGACGTGAGGGAATATGATAACAAAGAAGCTAGAAGGAATTGGACTAAAGTAGAAGGTATTAGAAAAATGAAAGAAGGTTATTTATCTTTAGCTGTTAGTAAATTGGCCGATATGATCATCGAAAATAATGCTATAATCGTAATGGAAGATTTGAATCACGGGTTTAAGGCAGGTCGTTCCAAAATTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTTAGGATACATGGTGCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTACCAATTAGCCAATCATGTTACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAATTTTTTACATACCTGCAGCTTTTACTTCGAAGATTGATCCCACCACGGGCTTTGCTGATTTATTCGCTCTCTCTAATGTGAAGAATGTCGCTTCTATGAGAGAGTTCTTCTCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCGCCTTTACATTTGATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGTGTATACAGTTGGTGAACGCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGGAAAGTCCCAACAGATATCATCTATGATGCATTACAAAAAGCTGGTATTAGCGTCGAAGGTGACCTTAGAGATAGAATCGCGGAAAGCGACGGTGACACATTAAAGTCTATATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAACAGAGAGGAAGACTATATACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACGCCGGAAAGTCTTTGCCGCAGGATTCAGATGCAAATGGTGCCTATAATATAGCTCTGAAAGGGATCCTACAACTCAGAATGTTGAGCGAACAATACGATCCAAATGCAGAATCGATTAGATTGCCACTTATAACTAACAAGGCATGGTTAACTTTTATGCAATCCGGTATGAAAACTTGGAAGAATTAA SEQATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGTTCGAATTGAAACCAGID TTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATTGAAAGAAGATGAACACAGAGCGGAATCCNO:TACCGCCGGGTAAAAAAGATAATTGACACATACCATAAAGTGTTTATTGACAGCTCCTTAGAGAACATG132GCTAAAATGGGGATAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTCTATAAGAAAGACCACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAGAGGTTTAATCGTTGGGGCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTACAAAATGAAAAGTACGAGAGTTTGTTCAAAGAAAAATTGATAAAGGAAATTTTGCCAGATTTCGTGTTGTCCACCGAGGCTGAGTCTCTTCCATTCAGCGTTGAAGAAGCAACAAGGAGCTTAAAAGAGTTTGACTCATTCACTTCTTATTTTGCTGGTTTTTACGAAAATAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGATAGCCTACAGATTAATTCATGAAAACTTGCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAGATTAAAGAACCAATCGCTAAAGAACTTGAGCACATAAGAGCAGATTTTAGCGCAGGCGGATATATCAAAAAAGATGAACGGCTAGAAGACATATTCTCATTAAATTACTACATTCATGTCCTTTCTCAAGCTGGTATAGAAAAATATAATGCTTTAATCGGGAAGATAGTGACGGAAGGTGATGGTGAAATGAAAGGTCTTAATGAACATATTAACTTATATAACCAACAGAGGGGTCGAGAGGATAGGTTGCCCTTGTTTAGGCCTCTATACAAGCAAATCCTGTCCGATAGAGAGCAATTGTCTTATTTACCTGAATCATTTGAAAAAGATGAAGAGCTGCTTAGAGCACTTAAGGAATTTTACGATCACATCGCCGAAGACATCTTGGGTAGAACACAGCAATTGATGACTTCAATTTCTGAATACGACTTGTCCCGTATTTATGTCAGAAATGATTCTCAACTTACAGACATCTCGAAGAAAATGCTAGGAGATTGGAACGCCATTTATATGGCTAGAGAACGAGCCTACGACCACGAACAGGCTCCTAAACGTATTACTGCTAAATACGAACGTGATAGAATCAAGGCCTTAAAAGGTGAAGAGTCAATTTCATTGGCGAATCTGAACAGCTGTATAGCTTTCTTGGACAATGTAAGGGATTGTCGAGTTGACACATACCTATCAACTTTGGGGCAGAAAGAGGGTCCTCATGGCTTAAGTAACTTGGTGGAAAACGTCTTCGCCTCATATCATGAAGCAGAACAGTTATTGTCGTTTCCTTACCCCGAAGAGAACAACCTTATTCAGGACAAAGACAATGTAGTTTTGATCAAAAACCTATTGGATAATATAAGTGATTTACAACGTTTCCTTAAACCTTTGTGGGGAATGGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGTGAATACAACTATATTAGAGGAGCGCTTGACCAGGTAATACCTTTGTACAATAAAGTAAGGAACTACTTGACTCGTAAACCATATTCTACTAGAAAAGTTAAATTGAACTTTGGTAATTCACAGCTGCTGAGTGGTTGGGATCGTAATAAAGAAAAAGATAACTCCTGTGTTATCTTGCGAAAAGGACAAAACTTTTACTTGGCAATTATGAACAACCGTCACAAAAGGTCCTTCGAGAACAAAGTTCTGCCTGAATACAAAGAAGGTGAACCATATTTTGAAAAAATGGACTATAAATTCCTGCCAGATCCTAATAAAATGTTGCCTAAGGTCTTCTTGTCTAAAAAAGGTATAGAAATATATAAACCATCCCCGAAGTTGCTGGAGCAATATGGTCATGGAACGCACAAAAAAGGTGACACTTTTAGTATGGATGACTTGCACGAGTTGATTGATTTTTTTAAACATTCCATTGAAGCGCACGAAGATTGGAAACAATTTGGTTTCAAGTTCTCTGACACAGCCACTTACGAAAATGTATCGTCCTTTTATAGAGAAGTGGAAGATCAGGGTTATAAACTGTCATTCCGTAAGGTTAGTGAAAGCTATGTGTACTCGTTGATCGATCAAGGGAAGCTTTATCTTTTTCAAATCTATAATAAAGATTTCTCTCCTTGTTCAAAGGGCACACCTAATCTTCATACACTATACTGGAGAATGCTTTTCGATGAAAGAAATTTGGCTGATGTGATCTATAAATTAGACGGTAAAGCTGAGATTTTTTTCAGAGAGAAATCCCTGAAAAACGACCATCCAACTCATCCGGCAGGTAAACCGATTAAAAAGAAATCCCGGCAAAAAAAGGGCGAAGAGAGTTTATTCGAGTATGATTTAGTTAAGGACAGACATTATACAATGGACAAATTTCAATTTCATGTGCCCATTACTATGAACTTTAAGTGTAGTGCAGGGTCTAAGGTTAATGATATGGTAAACGCACATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGATCGCGGAGAAAGAAATTTACTTTACATTTGCGTTATCGATTCTAGGGGCACCATCTTGGATCAAATCTCTTTGAACACTATAAATGATATTGACTATCATGATCTACTAGAGAGTCGGGATAAAGACAGGCAACAAGAAAGAAGAAATTGGCAAACAATTGAAGGTATTAAAGAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACACAGAATTGCCGAATTAATGGTAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAAAAGAGGGCGCCAGAAGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGATAAGTTGAATTATCTAGTGGATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTATTAAGAGCCTACCAATTCACTGCGCCATTTAAATCGTTCAAAGAAATGGGTAAACAAAACGGTTTTCTATTCTACATCCCCGCATGGAATACCTCAAATATAGATCCAACTACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACAAAGCAAAATCATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGAATTTGCGTTCGATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATGTGGATATTGTGCACCCACGGAAGTAGAATTAAGAACTTCCGTAATTCACAGAAAAACGGCCAGTGGGACAGCGAAGAATTCGCCCTAACCGAAGCTTTCAAAAGTCTTTTCGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGCTATTGTGGATGAGAAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGATGAGAAATAGTTGGAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGGTGCAGATGGCAGATTTTTTGATACACGTGAAGGCAATAAATCACTACCAAAAGACGCGGACGCTAATGGCGCATACAACATCGCATTGAAGGGTTTGTGGGCTCTCAGGCAGATTAGGCAGACAAGTGAGGGTGGTAAGCTTAAGCTGGCGATTTCTAATAAGGAATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA SEQATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAAAGACATTGAGAAAT IDGCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAAAAATGGTATTATAAAAGAAGACGAGTTGNO:CGTGGCGAGAATAGACAAATTTTGAAAGATATCATGGATGACTACTACAGAGGGTTCATCTCCGAAACA133TTGTCTTCTATTGACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCTGAAGAACGGAGATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACACAAAAAGTTTGCTAATGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTTCTGATATTCTACCCGAGTTTGTCATTCATAATAATAACTACTCTGCATCTGAAAAAGAGGAGAAGACCCAGGTTATAAAGTTGTTTTCAAGATTTGCCACATCATTTAAAGACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCTGATGATATTAGCTCTTCCAGCTGTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATGCCTTGGTATATAGACGCATAGTCAAGTCACTAAGCAATGATGATATAAACAAGATTAGTGGTGATATGAAAGATAGCCTTAAAGAAATGAGCCTTGAAGAGATATATTCATATGAGAAGTACGGTGAATTTATAACTCAAGAAGGAATTTCTTTTTATAACGATATTTGTGGTAAGGTTAATTCTTTTATGAATTTGTATTGCCAGAAGAACAAGGAAAATAAGAATCTATATAAACTACAAAAGTTGCATAAACAGATTTTGTGTATAGCTGATACATCCTACGAAGTTCCGTATAAATTTGAATCTGATGAGGAAGTTTATCAATCGGTAAACGGTTTTCTTGACAACATTTCCAGCAAACATATCGTTGAGAGACTACGTAAAATTGGAGACAACTATAATGGTTACAATCTAGATAAAATATACATAGTGTCCAAGTTTTATGAGTCTGTCTCTCAAAAGACATATCGTGATTGGGAGACCATTAATACTGCACTTGAAATTCATTATAACAACATATTGCCTGGTAACGGGAAGAGTAAAGCTGATAAGGTTAAAAAGGCCGTCAAAAACGACTTGCAAAAGTCTATTACCGAGATAAATGAATTAGTGTCAAACTACAAACTATGCTCAGATGATAATATTAAAGCGGAAACATACATCCACGAAATTTCCCACATACTGAATAACTTTGAAGCTCAGGAGCTTAAATATAACCCGGAAATACACTTGGTTGAGAGCGAGTTAAAAGCATCTGAGTTGAAAAATGTATTAGACGTCATCATGAATGCGTTTCATTGGTGTTCAGTTTTCATGACTGAAGAATTAGTCGACAAAGATAACAATTTTTATGCCGAATTAGAGGAAATATATGATGAAATTTATCCCGTAATTAGTTTATACAATCTAGTTAGAAATTATGTTACACAAAAGCCGTATAGTACCAAGAAAATAAAGCTTAATTTCGGAATACCTACGCTTGCTGATGGTTGGTCAAAAAGTAAAGAATATAGCAATAATGCAATAATTTTAATGAGAGATAACCTATATTATTTGGGTATTTTTAACGCTAAGAACAAACCAGACAAGAAAATAATTGAAGGTAATACATCTGAAAACAAGGGCGACTATAAAAAGATGATATACAATTTGCTCCCAGGTCCTAATAAAATGATTCCTAAGGTTTTCCTGAGTAGCAAGACTGGCGTTGAAACTTACAAGCCTAGTGCGTATATCCTGGAGGGTTATAAACAGAACAAGCATATCAAATCCTCTAAGGACTTCGATATCACCTTTTGCCATGACTTAATCGATTATTTTAAAAATTGTATCGCAATTCATCCAGAATGGAAAAATTTCGGATTTGATTTTAGTGATACCAGCACTTACGAGGATATCTCTGGGTTCTACAGAGAAGTGGAGTTGCAGGGCTACAAAATCGATTGGACTTACATATCTGAAAAGGACATAGATTTGCTGCAGGAGAAAGGTCAGCTATATTTGTTTCAAATCTACAACAAAGACTTTTCTAAAAAGTCTACCGGTAATGACAATCTGCACACAATGTACTTGAAGAACTTATTCTCCGAGGAGAACTTAAAGGACATTGTACTCAAGTTGAATGGAGAAGCCGAGATTTTTTTTAGAAAGAGCAGTATAAAGAATCCTATAATCCACAAGAAGGGCTCAATTCTCGTGAATAGGACGTATGAGGCAGAAGAAAAGGACCAATTTGGGAATATACAAATTGTAAGAAAAAACATCCCAGAAAATATCTACCAGGAATTATATAAGTATTTTAATGACAAATCTGATAAGGAACTGTCTGACGAAGCCGCTAAGCTCAAGAATGTTGTGGGCCACCATGAAGCTGCTACTAATATAGTGAAGGACTACAGATATACCTACGATAAATATTTCCTGCATATGCCAATTACTATAAACTTCAAAGCAAATAAAACAGGTTTTATAAATGATAGAATCCTGCAGTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGAGGTGAACGCAATCTGATCTATGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAAAGTCCTTTAATATTGTGAACGGATATGATTACCAAATCAAGTTGAAACAACAAGAGGGAGCACGCCAAATTGCCCGTAAGGAATGGAAAGAGATAGGTAAGATCAAGGAAATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAATTTCGAAGATGGTAATCAAATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAAGGTAGATTCAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAATTAAATTACTTAGTCTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAAAAGGGTATCAACTTACATACATACCTGATAAGTTGAAAAATGTGGGTCATCAGTGTGGGTGCATCTTTTATGTTCCAGCCGCTTACACATCAAAAATCGATCCTACTACTGGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAAAAAGAGAGTTTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTTTACTTTTGACTACAACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCCTGGTCAGTGTATACTTATGGTGTTAGAATTAAGAGACGTTTCGTAAACGGTCGTTTTTCTAACGAGTCCGATACAATCGACATCACTAAAGATATGGAAAAAACTTTGGAAATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGACAAGATATAATCGATTATGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCGTAACTCTCTTAGTGAGCTAGAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCTTAACGAAAATAATATCTTCTATGACTCGGCAAAAGCCGGTGATGCACTTCCAAAAGATGCTGATGCAAATGGCGCGTACTGCATCGCATTGAAGGGGCTCTACGAGATTAAACAAATCACCGAAAACTGGAAAGAAGATGGTAAATTTTCTAGGGATAAGTTGAAAATCAGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGATACTTATAG SEQATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGTTAATTCCCCAGGGAAID AGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCAGGATAAACAAAGAGCAGAATCATACCAGGNO:AGATGAAAAAAACCATAGATAAATTTCATAAGTACTTCATCGACTTGGCACTATCGAACGCCAAGCTAA134 CACATTTGGAAACCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAACAAAAATTCAAGGATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTTAGTGATGGTGATGCAAAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAACCGTGGAACTTGAAAAGTGGTTTGAAAATAACGAACAGAAAGATATTTACTTCGACGAAAAATTTAAAACGTTTACTACGTACTTTACAGGGTTCCATCAGAACCGCAAAAACATGTACTCCGTTGAACCAAACTCTACTGCAATCGCCTACAGATTAATACACGAAAATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGATAAAGCAAGTCGAATCGTTACAGGTAAACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGTCTTATTTTTGTCAATGAATTAGAGGAAATGTTTCAAATTAATTATTATAACGATGTCTTGAGTCAGAACGGCATTACTATCTACAACTCAATTATCAGTGGTTTCACTAAGAATGATATAAAATATAAAGGTTTGAATGAATACATTAATAATTATAATCAAACTAAAGATAAGAAGGACAGGCTTCCGAAATTGAAGCAATTGTACAAGCAGATTCTAAGTGATAGGATTAGTTTGTCTTTCTTGCCAGACGCATTTACTGATGGCAAGCAAGTCTTAAAGGCTATATTCGATTTCTACAAGATTAACCTACTTTCGTACACAATTGAAGGTCAAGAAGAATCTCAAAATCTGCTGCTTTTGATTAGGCAAACTATAGAAAATTTGTCGTCCTTTGACACTCAAAAAATTTACCTGAAGAATGATACACACCTGACTACAATATCACAGCAGGTCTTTGGGGATTTTTCTGTCTTCTCCACGGCCCTAAACTATTGGTATGAGACAAAAGTTAATCCAAAATTTGAAACAGAATATAGTAAGGCGAATGAAAAAAAGAGAGAAATTTTGGATAAAGCGAAGGCAGTATTCACAAAACAAGACTATTTTTCTATCGCATTTCTCCAAGAAGTCTTATCCGAATATATTTTGACACTCGATCACACCTCTGATATAGTTAAGAAACATTCGTCCAACTGCATCGCAGATTACTTCAAGAATCACTTCGTGGCTAAGAAAGAAAACGAAACGGATAAAACTTTTGACTTCATTGCTAACATAACCGCTAAATACCAATGTATTCAGGGCATATTAGAAAATGCAGACCAGTACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGATAATCTAAAGTTTTTCTTAGATGCTATACTTGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATCGGAAAGTATTACTGAAAAAGACACTGCGTTCTATGATGTGTTCGAAAATTATTATGAGGCTTTATCTTTATTGACCCCCCTTTACAACATGGTCCGCAATTATGTTACTCAGAAGCCTTACTCTACTGAAAAGATCAAATTAAACTTTGAAAATGCTCAGTTGCTGAATGGTTGGGATGCCAATAAGGAAGGTGACTACCTGACGACTATTCTAAAAAAAGACGGTAATTATTTCTTAGCAATCATGGATAAAAAACATAACAAGGCATTTCAAAAATTTCCAGAAGGAAAAGAAAACTATGAAAAGATGGTTTATAAATTGTTGCCTGGAGTTAATAAAATGTTGCCAAAAGTTTTTTTTAGCAATAAGAACATAGCTTACTTTAATCCATCTAAGGAACTGCTCGAGAACTACAAGAAGGAAACACATAAAAAAGGTGATACATTTAATTTGGAACATTGCCATACTCTGATTGATTTTTTTAAGGACTCTCTTAATAAACATGAAGACTGGAAATATTTTGATTTTCAATTTTCGGAAACTAAATCATACCAAGATCTAAGTGGATTTTACAGAGAAGTTGAACACCAAGGTTATAAGATTAACTTCAAGAATATAGATTCTGAATACATTGATGGTCTTGTAAACGAGGGTAAACTATTCCTGTTCCAAATCTACTCTAAGGACTTCTCACCTTTTTCCAAAGGAAAACCTAATATGCATACGTTGTACTGGAAGGCTCTATTTGAAGAACAAAATTTGCAAAATGTAATCTACAAACTGAACGGCCAAGCTGAAATATTCTTCAGAAAAGCCTCAATTAAGCCAAAAAACATTATTCTTCATAAAAAGAAGATCAAGATTGCGAAGAAACATTTTATTGATAAGAAGACCAAGACTTCCGAAATTGTACCAGTACAAACAATCAAGAATCTCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTTAACCCAGGATGATTTACGTTATATAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACATTATCAAAGATAAAAGGTTTACTGTTGACAAATTTCAATTTCATGTGCCTATCACAATGAACTTTAAGGCCACAGGTGGTTCGTACATTAATCAAACTGTTTTAGAATATCTGCAAAATAACCCAGAGGTCAAGATCATCGGTCTTGATAGGGGTGAGAGACATCTGGTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGAAGCAAGAATCATTGAACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAATAAAGAAAACGAAAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAGGAACTAAAAGAAGGTTACATTTCGCAAGTAGTTCACAAGATTGCAACACTCATGTTGGAAGAAAACGCAATCGTTGTCATGGAAGATTTAAATTTCGGATTTAAGAGAGGAAGATTTAAAGTAGAAAAGCAAATCTACCAGAAGTTGGAGAAGATGTTAATTGACAAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGGGCGGTCTATACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCAGAGTGGTTTTTTGTTTTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACTACAGGCTTCGTGAATTATTTCTACACTAAATATGAAAATGTGGATAAAGCAAAAGCTTTCTTTGAGAAGTTCGAGGCGATCCGTTTTAACGCTGAAAAGAAGTACTTCGAGTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGAAGGCACCCAGCAAGCATGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGATCAAAATAATAAGTTTGTAAGCACACCCATTAACTTGACAGAAAAGATAGAAGATTTTCTTGGAAAAAACCAAATTGTATATGGTGACGGTAACTGTATCAAGTCACAAATTGCTTCTAAAGACGATAAGGCCTTCTTCGAAACTCTGCTATACTGGTTTAAAATGACGTTGCAAATGAGAAACAGTGAAACTAGAACTGATATCGACTATTTAATATCACCCGTGATGAACGATAATGGTACCTTTTACAATTCAAGAGATTACGAGAAATTGGAGAACCCCACACTACCAAAAGACGCAGACGCTAATGGTGCCTACCATATTGCTAAAAAGGGACTGATGTTGTTGAACAAGATAGATCAAGCCGACTTAACTAAAAAAGTTGATTTGTCAATTTCGAATAGAGATTGGTTGCAATTCGTCCAGAAAAATAAGTAA SEQATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGGCTGTTACGGACTCC IDGAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGGTGTCAGACTATTCGAATCAGCATCGACCNO: GCGGAAGAGAGAAGAATGTTTAGAACTTCAAGAAGAAGGCTGGATCGTAGGAATTGGCGGATAGAAAT135TTTACAAGAAATATTCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGTATGAAGGAATCCAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCCTATGCTTTATTTGTGGACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCCCAACAATTTACCATCTGAGAAAGATGTTAATGAACACTGAAGAAACCCCGGATATAAGACTGGTCTATCTAGCCATTCATCATATGATGAAACACAGGGGACACTTCTTGCTATCAGGGGATATAAATGAAATTAAAGAATTTGGTACAACATTTTCTAAATTATTGGAAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAGGCAAGGAGGAATACGCAGTTGTCGAATCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAAAAAACAAGGCTGATCAAGGCTTTAAAAGCGAAATCAATATGCGAAAAAGCAGTATTGAATTTGTTAGCTGGGGGGACTGTCAAGTTGTCTGATATTTTCGGATTGGAAGAATTGAATGAAACAGAGAGACCGAAGATATCCTTCGCCGATAATGGCTACGATGATTATATAGGCGAAGTCGAAAATGAGCTGGGCGAACAATTCTACATTATCGAGACTGCCAAGGCTGTTTATGATTGGGCGGTGTTAGTCGAAATCCTTGGCAAATACACTTCCATCTCCGAAGCTAAGGTGGCAACCTACGAAAAGCATAAAAGTGATTTGCAATTCCTTAAGAAAATTGTCCGAAAGTACTTGACCAAAGAAGAGTACAAGGATATTTTCGTATCAACATCGGACAAACTGAAGAATTATTCAGCTTATATTGGCATGACGAAAATTAATGGTAAGAAAGTTGATTTGCAATCCAAGAGATGTTCTAAAGAAGAATTTTACGATTTCATTAAAAAAAATGTCCTAAAAAAGTTGGAGGGACAACCTGAATATGAGTATTTAAAGGAAGAACTGGAAAGAGAAACTTTCCTACCAAAGCAAGTTAATCGTGATAATGGCGTTATTCCATACCAAATACACTTGTACGAATTAAAGAAGATCTTGGGTAACTTGAGGGACAAAATTGATTTAATCAAGGAAAATGAAGACAAACTGGTACAATTATTTGAATTTAGAATACCTTACTACGTGGGCCCTTTAAACAAAATAGACGATGGTAAGGAAGGGAAGTTCACATGGGCAGTCAGAAAGTCCAATGAAAAAATTTACCCATGGAATTTCGAAAACGTTGTAGATATTGAAGCTTCTGCTGAGAAATTTATTAGGAGAATGACAAATAAATGCACTTATCTTATGGGGGAAGACGTGTTGCCTAAAGATAGTTTATTATATTCAAAGTATATGGTCTTAAATGAATTAAACAATGTTAAATTAGATGGTGAAAAACTTTCCGTCGAATTGAAACAAAGATTGTATACAGATGTATTCTGCAAATATAGAAAAGTAACTGTAAAGAAGATTAAAAACTACCTTAAATGTGAAGGCATTATCAGCGGAAATGTTGAGATCACTGGTATCGATGGTGATTTTAAGGCATCTTTAACCGCATATCACGACTTTAAGGAAATATTGACGGGTACTGAGCTTGCTAAAAAAGACAAAGAGAACATTATCACCAATATCGTGCTCTTCGGAGACGACAAGAAATTATTGAAAAAGAGATTGAACCGCCTATACCCTCAGATTACCCCTAACCAATTGAAGAAAATCTGCGCTCTGTCTTATACTGGATGGGGTCGTTTTAGCAAGAAGTTTCTAGAAGAAATTACTGCTCCGGATCCTGAAACTGGGGAAGTCTGGAATATAATTACCGCGCTATGGGAATCGAATAATAATTTAATGCAATTACTATCTAATGAATACAGATTTATGGAAGAAGTCGAAACTTACAATATGGGAAAACAAACAAAAACTTTGAGCTACGAAACAGTAGAGAATATGTATGTCTCACCATCTGTAAAGCGGCAGATCTGGCAAACCTTGAAGATAGTTAAAGAATTAGAAAAAGTGATGAAGGAAAGTCCAAAAAGGGTTTTTATTGAAATGGCCCGAGAAAAACAAGAATCTAAAAGGACGGAAAGTAGGAAAAAGCAACTTATAGATCTATATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTAAAGGAATTAGGTGACCAGGAAGAGCAAAAATTGAGATCTGACAAGCTGTACTTGTATTATACGCAAAAGGGCCGGTGTATGTATTCGGGTGAGGTAATAGAATTGAAAGATTTATGGGATAACACTAAGTATGACATTGACCATATTTACCCCCAGTCTAAGACAATGGACGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAAGTACAATGCCACAAAGAGCGATAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCTGGAAATCATTGTTGGACGGTGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAACACTGAACTATCTCCAGAGGAGTTAGCTGGCTTTATCGAAAGACAAATTGTTGAAACTAGACAGTCTACAAAAGCAGTTGCAGAAATCTTAAAACAAGTATTTCCAGAATCCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTAGATTTAGAAAAGACTTTGAATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTTATTTAAATATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGTTCATCAAGGAGAATCCTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGGATGGAATATCGAAAGAAATGGTGAGGTTGCGTGGGAGGTAGGCAAGAAGGGAACCATTGTTACTGTAAAGCAAATTATGAATAAAAACAATATACTTGTTACGAGACAGGTGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAAAGGTAAAGGTCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGGGGCTACAATAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAAAAGGGAAGACGATCCGGACCATAGAGTTTATCCCTCTGTACTTGAAGAATAAGATTGAGTCTGACGAAAGCATCGCATTGAATTTCTTGGAAAAGGGGCGCGGTCTAAAGGAGCCAAAAATATTGTTAAAGAAAATTAAAATAGACACCCTATTCGACGTCGATGGGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCAAGTGTGCCAATCAGTTAATCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAGTTTATTCAAAGGCGACAAGAAAATAGAGAACTAAAGTTGAGTGATAAGGATGGAATCGATAATGAAGTGTTAATGGAGATTTATAACACTTTTGTCGACAAATTGGAGAATACGGTGTACAGAATTAGGCTATCTGAACAGGCTAAAACCCTAATTGATAAACAGAAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTCAACTCTTTTCGAGATCCTACATATCTTTCAGTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAGGTCCTGGTAAGGCTGGTATATTAGTCATGAACAACAACATATCTAAGTGTAATAAGATTAGTATAATTAACCAATCACCGACAGGTATCTTTGAAAATGAAATTGATTTACTTAAA SEQATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAATTTGAAATGCGACCTGID TTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTTCGAAAAGGATAAACTGATCCAGAAAAAATNO:ACGGTAAAACGAAACCATATTTCGATAGGTTGCATCGGGAATTTATAGAAGAAGCTTTGACTGGTGTAG136AATTAATTGGCTTAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAAAGAACAATGTTGCCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAAATTTTCAATTTGAAGGCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTCTTAAGAATAAGAATACTGATATATTGTTCGAGGAGGCCGTTTTCGGTATTCTTAAGGCAAGATATGGTGAAGAGAAAGACACGTTTATTGAAGTTGAGGAGATTGATAAAACCGGTAAGTCCAAAATCAACCAGATCTCTATCTTCGACAGTTGGAAGGGCTTCACTGGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATAAAAACGATGGTACTTCCACGGCTATTGCTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATTGATAACCTATCAATTGTTGAAAGTGTTCGACAGAAAGTAGATTTGGCTGAAACTGAAAAATCTTTTAGTATCTCCTTATCCCAGTTTTTCTCTATAGATTTTTATAATAAATGTTTGCTGCAAGATGGCATTGACTACTATAATAAAATAATTGGTGGAGAGACATTGAAAAACGGAGAGAAGCTGATTGGCCTTAATGAGTTGATAAATCAATATAGACAAAATAATAAGGACCAGAAAATCCCTTTCTTTAAATTGCTAGACAAACAGATTTTGTCTGAAAAGATCCTATTCTTGGATGAAATAAAGAACGATACTGAATTGATTGAAGCTTTGTCCCAGTTTGCTAAAACAGCTGAAGAAAAGACAAAGATTGTGAAAAAATTGTTTGCTGATTTCGTAGAAAACAATTCTAAATATGATCTAGCCCAGATTTATATAAGTCAAGAAGCTTTCAATACAATAAGTAATAAGTGGACAAGTGAAACAGAAACTTTTGCTAAGTATTTATTCGAAGCCATGAAGTCTGGTAAACTTGCCAAATACGAAAAAAAAGATAACAGTTATAAATTTCCAGACTTTATAGCCCTTTCACAGATGAAGTCTGCCTTATTGTCGATATCCTTAGAAGGTCATTTTTGGAAGGAAAAATATTATAAGATAAGCAAGTTCCAAGAAAAGACTAATTGGGAACAATTTTTGGCTATATTTCTATATGAGTTCAATTCATTATTTTCCGATAAAATCAACACTAAGGATGGAGAGACTAAGCAAGTTGGCTACTATTTGTTCGCAAAAGATCTGCACAATTTGATTCTATCAGAACAAATAGATATACCAAAAGATTCAAAGGTAACTATAAAGGATTTCGCAGATTCCGTCCTCACCATTTATCAAATGGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCGTGGTTAGCAGAATACGAGTTGGACTCGTTTTATACTCAGCCAGATACTGGATACTTGCAATTCTACGATAATGCATACGAAGACATTGTACAGGTATACAATAAACTTAGAAATTACTTAACCAAGAAGCCCTACAGTGAAGAAAAATGGAAGCTGAACTTTGAAAATTCGACTTTGGCAAATGGTTGGGATAAAAATAAAGAAAGTGACAACTCCGCAGTGATTTTGCAAAAGGGTGGGAAATATTACTTGGGTTTAATCACAAAAGGCCACAATAAGATTTTTGATGATAGATTTCAAGAAAAATTCATAGTTGGTATAGAAGGTGGCAAATACGAGAAAATTGTCTATAAATTCTTCCCTGATCAAGCCAAAATGTTCCCAAAAGTTTGCTTTTCTGCTAAAGGATTGGAGTTTTTCCGGCCTAGCGAGGAGATCCTTCGTATCTACAACAATGCTGAATTCAAAAAAGGAGAAACCTATAGCATAGATTCTATGCAAAAACTGATAGATTTTTATAAGGATTGTTTAACAAAGTACGAAGGCTGGGCCTGCTATACATTTAGACATTTAAAGCCCACAGAAGAATACCAAAATAACATTGGTGAATTCTTTCGGGACGTTGCCGAAGACGGCTATAGGATCGATTTTCAAGGTATCTCAGATCAATATATCCACGAAAAGAACGAGAAGGGTGAGCTGCACCTTTTCGAAATTCATAATAAGGACTGGAATTTGGATAAGGCGAGAGATGGTAAATCGAAGACCACTCAAAAGAACTTGCATACTTTATATTTTGAGTCCTTGTTTTCTAATGATAACGTCGTCCAAAATTTTCCAATAAAGTTGAATGGACAAGCGGAAATTTTCTATCGGCCTAAGACAGAGAAAGACAAATTAGAATCAAAGAAAGATAAAAAGGGAAATAAAGTCATTGATCACAAACGATACTCTGAGAATAAAATATTTTTCCACGTACCATTGACACTCAACAGGACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTTGGCAAATAACAAGGATATTAACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTATATTACTCTGTCATCACTCAGGCTTCTGATATATTGGAAAGCGGGTCTCTAAATGAATTGAACGGTGTTAACTACGCCGAAAAGCTAGGTAAAAAAGCTGAAAACAGAGAGCAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTAAAGACCTTAAAAAAGGCTACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACGCAATCATTATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAAAAAAGTATTTATCAGCAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAGTTGACAAAGGTGAAAAAAATCCTGAACAAGCTGGTCATCTGTTGAAAGCTTATCAGCTGAGCGCACCTTTTGAAACATTTCAAAAAATGGGAAAACAAACAGGTATTATTTTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGGATGGAGACCACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAAGTTTACAAAAATAGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAAGATTTTCAGCAAGCAAAAGAATACCCAAATAAGACAGTGTGGAAAGTATGCTCCAATGTGGAGAGATTTAGATGGGATAAAAATCTCAATCAAAACAAGGGTGGTTACACACATTATACTAATATAACTGAAAATATTCAAGAATTGTTTACTAAGTACGGAATTGACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAACAAAATACCTCATTTTTCCGCGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAACACGGATGATTCCGAAATTGCCAAGAAGAACGGAAAAGATGATTTCATCCTATCTCCAGTGGAACCATTTTTTGACTCAAGAAAAGATAATGGTAATAAGTTGCCTGAGAACGGAGATGATAACGGCGCTTATAATATCGCTCGGAAGGGTATTGTAATTCTTAATAAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAGAAAATGAAGTGGGGCGACTTGTATGTATCTAATATAGATTGGGATAATTTCGTTACTCAAGCCAACGCGAGACATTGA SEQATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTAGGCCATACGGCAAA IDACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGACGCCTTTAAGGCAAATTCCAGAAGATCANO:ATGCAGGCAATTATTGATGAGAAATTTAAAGAGACTATCGAGGAAAGGTTGAAATACACTGAATTCTCT137GAGTGCGATCTGGGAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCTACCAACCTCAAAAAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCGGACAAAAACATTGACGCCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAACTTTTAAGGGTTTTACCACGTACTTTGTAAATTTTTTTGAGATTCGTAAACATATCTTCAAAGGAGAATCGTCGGGTTCCATGGCCTATAGGATAATTGATGAAAATCTTACGACTTACTTAAACAATATCGAAAAGATAAAAAAGTTACCAGAAGAATTAAAGTCTCAATTGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAATGAGTTCATAACTCAAAGCGGTATCACACATTACAATGAAATTATCGGTGGTATATCTAAAAGTGAGAACGTAAAAATACAGGGAATAAACGAGGGGATCAATCTATACTGTCAGAAGAATAAAGTAAAATTACCAAGACTAACGCCATTATACAAAATGATTCTGTCTGATAGAGTTTCCAACTCGTTCGTGCTTGATACTATAGAAAATGATACTGAATTAATTGAGATGATTAGCGACTTGATTAATAAAACAGAAATATCTCAAGACGTAATAATGTCAGACATTCAGAACATTTTCATAAAATATAAACAGCTTGGTAATTTACCGGGGATAAGTTACTCTAGCATCGTGAATGCTATTTGCTCCGATTATGACAATAATTTTGGTGACGGAAAAAGAAAAAAATCATATGAGAACGATAGGAAGAAACACCTTGAAACAAACGTATACTCAATTAACTATATATCGGAACTGTTAACAGACACCGATGTATCATCTAATATAAAAATGAGATATAAGGAACTTGAACAAAATTACCAGGTGTGTAAGGAGAATTTCAATGCTACCAACTGGATGAACATTAAGAATATTAAACAGAGTGAAAAGACAAACTTGATTAAAGATCTACTAGATATACTGAAATCAATACAGAGATTCTACGATCTGTTTGATATAGTTGATGAAGACAAAAATCCTAGTGCTGAGTTTTACACGTGGCTAAGTAAAAATGCGGAAAAGTTAGATTTCGAGTTCAACTCTGTTTATAATAAATCTAGGAATTATTTAACTAGAAAGCAGTATTCTGATAAAAAGATAAAATTGAACTTCGACTCCCCTACGTTGGCAAAGGGTTGGGATGCAAACAAAGAAATCGATAACTCCACCATAATAATGCGTAAGTTTAACAATGATAGGGGGGATTACGATTATTTTTTGGGAATTTGGAACAAATCTACCCCAGCGAATGAAAAAATTATTCCCCTTGAAGACAATGGTCTTTTTGAAAAAATGCAGTATAAATTATATCCAGACCCATCCAAGATGCTTCCAAAGCAATTTCTGTCAAAAATTTGGAAGGCTAAACACCCTACTACTCCTGAATTTGATAAGAAGTATAAGGAGGGCCGACACAAAAAGGGTCCAGATTTTGAAAAAGAATTCCTGCATGAATTGATAGATTGTTTTAAGCATGGTTTGGTAAATCATGATGAAAAATATCAGGATGTCTTTGGATTCAATTTGAGAAATACAGAGGATTACAACTCATATACAGAATTTCTCGAGGACGTCGAACGTTGCAATTATAATCTCAGTTTCAACAAGATCGCAGACACTTCAAACTTAATTAACGACGGAAAATTGTACGTTTTTCAAATCTGGTCGAAAGACTTTAGTATTGATTCAAAGGGTACAAAAAACCTAAATACAATATATTTCGAAAGTCTATTCTCGGAAGAAAACATGATCGAAAAAATGTTCAAACTGTCAGGCGAAGCTGAAATATTCTACCGTCCCGCAAGCCTTAATTATTGTGAGGATATCATTAAAAAAGGACATCACCATGCAGAGTTAAAAGATAAATTCGATTACCCAATAATTAAAGATAAAAGATACTCCCAGGATAAGTTCTTTTTCCATGTACCTATGGTTATTAACTACAAGTCGGAAAAACTAAACTCGAAGTCATTAAATAATAGAACTAACGAGAACTTGGGACAATTCACACATATAATTGGTATTGATCGTGGCGAAAGACATTTAATATATCTGACTGTTGTTGATGTTTCAACAGGAGAAATTGTTGAACAGAAACATCTTGATGAAATTATAAACACAGATACAAAAGGCGTTGAGCATAAAACTCATTATCTAAATAAATTGGAGGAAAAGTCGAAGACTCGCGATAACGAGAGAAAGAGTTGGGAAGCAATTGAAACCATAAAAGAGCTTAAAGAAGGTTACATTAGTCACGTCATCAATGAAATACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTAAATTATGGTTTTAAGAATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAACGGCACTTATTAAAAAGTTTAACTACATTATTGATAAAAAGGACCCAGAAACTTATATTCATGGTTACCAACTGACGAACCCAATCACAACATTGGACAAAATTGGAAACCAAAGTGGAATTGTTTTATACATTCCAGCTTGGAATACATCCAAAATAGACCCTGTCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAACCAAGAACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAATTTAAATTCGACATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAACTAAATGGACACTCACCTCATACGGCACTCGTATCCAAACCTTTCGGAATCCCCAAAAAAATAACAAATGGGATTCTGCAGAATACGACTTGACCGAGGAATTTAAATTAATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGACATACAAGAAGTTTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGATATTGATTACATGATTTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCTAGGGAAAATATTAAAAATTTACCTGCTGACGCAGACGCAAACGGCGCATACAATATAGCAAGAAAAGGGATTATGGCCATTGAGAATATTATGAATGGCATATCAGATCCATTAAAGATAAGCAATGAAGACTACTTAAAATACATTCAGAATCAGCAAGAATAA SEQATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGTTCGAATTGATCCCACID AGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTATAGAAGAAGATAAAGCGAGAAACGATCACNO:TATAAAGAGTTAAAACCCATTATTGACAGGATCTATAAAACATACGCCGATCAATGCCTTCAATTAGTG138 CAATTAGATTGGGAAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACAGAAGAAACAAGAAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTCATCGGTAGAACTGATAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATATATAAAGGACTATTTAAAGCAGAATTATTCAACGGAAAGGTGTTGAAACAGTTAGGTACCGTTACAACTACTGAGCATGAAAATGCCTTGCTGAGAAGCTTTGACAAGTTTACTACCTACTTTTCGGGTTTCTACGAAAATCGCAAAAATGTATTTTCTGCGGAAGATATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCCTAAGTTTAAAGAGAACTGTCACATTTTTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAACATTTTGAGAATGTAAAAAAAGCGATTGGTATATTTGTATCCACTAGCATTGAAGAGGTTTTCAGCTTCCCTTTTTATAACCAATTACTTACCCAAACACAGATCGACCTGTACAACCAATTGTTAGGTGGTATATCGAGGGAGGCTGGTACGGAAAAGATTAAAGGATTAAATGAAGTTCTTAATTTGGCCATACAAAAAAATGATGAAACCGCGCACATTATCGCATCTTTACCACATAGGTTTATACCGTTATTCAAGCAAATATTATCTGATCGTAATACCTTATCGTTCATATTAGAGGAGTTTAAATCTGACGAAGAAGTTATACAATCTTTTTGCAAGTATAAGACGCTATTGAGAAACGAAAACGTTCTGGAAACAGCCGAAGCACTGTTCAATGAATTAAACAGTATCGACTTGACTCATATTTTTATATCGCATAAAAAGTTGGAGACAATTTCTTCAGCATTGTGCGATCACTGGGACACTTTAAGGAACGCACTATATGAACGTAGGATCTCAGAATTGACAGGTAAGATAACGAAGTCTGCTAAAGAGAAAGTGCAGAGATCCCTAAAACACGAGGATATAAATTTGCAGGAGATAATTTCAGCTGCAGGTAAAGAGTTGTCTGAAGCGTTCAAGCAAAAGACTTCCGAAATCTTGTCACACGCACACGCCGCATTAGATCAACCTTTACCCACTACTTTGAAAAAACAAGAAGAGAAGGAGATATTAAAATCACAACTTGATTCTTTACTTGGCCTTTATCATCTTTTAGATTGGTTCGCTGTTGACGAGAGCAATGAAGTGGATCCAGAGTTTTCCGCAAGATTGACCGGTATAAAGTTGGAAATGGAACCTTCGTTATCATTTTACAACAAAGCTAGGAACTATGCTACAAAAAAACCTTATTCTGTCGAAAAATTTAAACTGAACTTCCAAATGCCTACTCTAGCAAGTGGCTGGGATGTTAATAAAGAAAAGAACAATGGCGCTATTTTGTTTGTAAAAAATGGCCTATACTATCTTGGAATTATGCCTAAACAAAAAGGTCGCTACAAGGCTTTGTCATTTGAACCTACTGAAAAGACTAGCGAAGGTTTCGATAAGATGTATTACGATTATTTCCCGGATGCCGCTAAAATGATCCCCAAGTGCTCTACTCAATTGAAGGCAGTAACTGCTCATTTCCAAACGCATACCACGCCAATACTGCTTTCTAACAACTTTATAGAACCACTAGAAATAACGAAAGAAATTTACGACCTAAATAACCCAGAGAAAGAACCAAAAAAGTTCCAGACGGCCTACGCCAAAAAGACAGGGGACCAAAAAGGTTACCGCGAGGCGTTATGTAAATGGATTGATTTTACTAGGGACTTTTTATCAAAATACACTAAAACGACGTCTATTGATCTTAGCTCCTTACGCCCGTCCTCCCAATACAAGGATCTAGGTGAGTATTACGCAGAGTTGAACCCGCTATTATACCATATTTCCTTCCAAAGGATTGCTGAAAAGGAAATTATGGACGCTGTTGAAACTGGGAAATTGTACCTGTTTCAGATTTATAATAAGGACTTCGCAAAGGGTCACCATGGTAAGCCTAACCTTCACACTTTGTACTGGACCGGACTATTCTCGCCTGAAAATTTGGCTAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTTATTTTATAGACCCAAATCTAGAATGAAAAGAATGGCCCATAGATTAGGCGAAAAGATGTTAAACAAGAAATTAAAGGACCAAAAAACCCCGATACCAGACACTCTATACCAAGAACTGTACGACTATGTGAATCACAGGCTTAGTCACGATTTATCAGATGAAGCGAGGGCTTTATTGCCAAATGTCATCACCAAGGAAGTATCACATGAAATAATTAAGGATAGAAGGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTATCAAGCAGCGAACTCACCATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAACATCCCGAAACACCAATCATCGGCATAGATCGAGGTGAGAGAAACTTAATATATATAACTGTGATTGATTCTACAGGAAAAATCCTGGAGCAACGATCTTTAAATACCATACAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGAAAAAGAACGTGTTGCCGCTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGCTATCTGTCCCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTCGTGTTGGAAAATTTGAATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAAAAGCTGTGTACCAGCAGTTTGAAAAGATGTTAATCGATAAGCTAAACTGCCTTGTTCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGTGTTCTTAATCCATATCAGTTGACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTTCTTATTCTACGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCCTTTTGTTTGGAAGACGATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGCTTTGATTTCTTGCACTACGACGTGAAAACTGGTGATTTTATCTTACACTTTAAAATGAACAGAAATCTCTCTTTCCAACGTGGACTGCCCGGATTCATGCCGGCTTGGGACATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGGTACACCATTTATAGCGGGTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATATAGGGATCTGTACCCTGCTAATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGTGTTTCGAGATGGATCGAATATTTTACCTAAGTTGTTGGAAAATGATGATTCACACGCAATTGATACTATGGTTGCCCTCATAAGATCGGTATTGCAAATGAGAAACTCAAATGCTGCTACGGGAGAGGATTATATAAACAGCCCCGTTCGCGATCTTAATGGTGTTTGTTTTGATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGCCGACGCAAACGGAGCATATCATATTGCTCTTAAAGGCCAACTACTATTAAATCACTTAAAGGAATCCAAAGACCTAAAATTGCAAAACGGGATATCTAATCAGGATTGGCTGGCTTACATACAAGAACTACGTAACTAG SEQATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTCGTGCGGGGTTATGG IDAAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATACCGAATGGTTATCATTACTTCGACAAGAGNO: AATTTGTACAGAAGGTCCCCGAACGGCGACGGTGAGCAAGAATGCGATAAGACGGCTGAAGAATGTAA139GGCAGAACTTTTGGAGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCCTGCGGGATCTGATGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCTATTGGGGCTAAAGGAGACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGGCTGACAAAGATGCCGTGGGAGGATTAGGTATCGCTAAAGCAGGTAATAAACCAAGATGGGTTAGAATGAGAGAAGCAGGCGAACCTGGTTGGGAAGAAGAGAAAGAAAAGGCCGAAACTAGAAAAAGCGCTGACAGAACCGCAGATGTTTTACGGGCCTTGGCTGATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAGAAATGTCTTCCGTTGAATGGAAGCCCCTAAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGATATGTTTCAACAGGCTATTGAAAGGATGATGTCATGGGAATCCTGGAATCAAAGAGTAGGTCAAGAATACGCTAAACTGGTCGAACAAAAGAATAGATTTGAACAAAAAAATTTTGTAGGTCAAGAACATTTAGTACATTTGGTTAATCAACTTCAACAAGATATGAAAGAGGCATCTCCTGGTTTGGAATCAAAAGAACAAACAGCACACTATGTTACCGGCCGAGCTTTGCGAGGTTCTGACAAAGTATTTGAAAAGTGGGGGAAATTAGCTCCCGATGCCCCCTTTGATCTATATGATGCTGAAATTAAAAACGTTCAAAGAAGGAACACTAGACGTTTTGGATCCCATGATCTTTTTGCAAAGCTAGCTGAGCCAGAATACCAGGCTCTATGGCGTGAAGACGCCTCGTTTTTGACTAGATACGCAGTATACAATTCAATACTCAGAAAACTAAACCATGCCAAGATGTTTGCTACATTCACCCTGCCCGATGCTACCGCTCATCCTATTTGGACTAGATTTGACAAGTTGGGGGGGAATCTACATCAGTACACATTTTTATTTAATGAATTCGGTGAAAGAAGACACGCTATTAGATTCCACAAGCTCCTAAAGGTTGAAAACGGCGTTGCGAGAGAAGTTGATGATGTAACAGTTCCCATTTCTATGTCGGAGCAATTGGATAATCTATTGCCTAGAGACCCTAATGAACCAATTGCTTTGTACTTTCGTGACTACGGTGCAGAACAACACTTTACAGGTGAATTCGGCGGAGCCAAGATTCAATGTAGACGTGATCAACTCGCACACATGCATAGAAGAAGAGGCGCTCGTGATGTTTATTTAAATGTGTCTGTTAGAGTTCAATCCCAATCGGAGGCTAGAGGTGAAAGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGTAGGTGATAATCATAGGGCATTTGTCCACTTCGACAAATTAAGTGATTATTTAGCAGAGCACCCTGATGATGGAAAGTTGGGCAGTGAGGGATTATTAAGTGGGTTGAGGGTAATGTCTGTAGATCTTGGTCTTCGTACTTCTGCGAGTATCTCTGTCTTTAGAGTAGCACGTAAGGATGAGTTGAAACCTAATAGCAAAGGAAGAGTCCCGTTTTTTTTTCCTATTAAGGGTAACGATAACCTGGTGGCCGTGCATGAAAGATCACAACTTTTGAAATTGCCAGGAGAAACGGAGTCCAAGGACTTGAGGGCAATTAGAGAGGAACGTCAGCGTACATTGCGACAGCTGAGAACTCAATTGGCTTATTTGAGGTTGTTGGTTAGGTGTGGTTCCGAGGATGTTGGCAGAAGAGAAAGGTCTTGGGCCAAATTGATAGAACAACCAGTGGACGCCGCAAATCACATGACACCAGATTGGAGAGAAGCTTTCGAAAATGAACTCCAGAAATTAAAGAGCCTACATGGCATATGCTCTGATAAAGAGTGGATGGATGCCGTATACGAATCCGTTCGTAGAGTCTGGCGCCACATGGGTAAGCAAGTACGGGACTGGAGAAAGGATGTTCGTTCCGGCGAAAGACCGAAGATAAGGGGGTATGCAAAGGACGTTGTAGGCGGTAATTCTATTGAACAGATTGAGTATTTGGAAAGGCAGTACAAATTTCTTAAATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACAAGTCATCAGGGCTGAAAAAGGTTCCAGATTTGCTATTACGCTAAGGGAACATATTGATCATGCGAAAGAAGATAGACTGAAAAAACTAGCAGATAGAATAATTATGGAAGCACTTGGTTACGTCTATGCACTTGATGAAAGAGGCAAGGGGAAATGGGTAGCTAAATACCCGCCTTGTCAACTTATTTTATTAGAAGAATTAAGCGAGTACCAATTTAACAACGATAGACCTCCATCCGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGTTTTTCAAGAATTGATAAATCAAGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCGAGCCGTTTTGATGCAAGAACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGCCAGATGTACACAGGAACATAACCCTGAGCCATTTCCGTGGTGGCTTAATAAGTTTGTTGTCGAGCACACATTAGACGCATGCCCTCTGAGAGCAGATGACCTTATACCCACTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAAGAAGGTGACTTTCACCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGTCGGATTTTGATATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAGAATTGGTGCTGATTCCAAGATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAACAAGGTGTTTTACACAAATACTGGTGTTACCTATTACGAAAGAGAAAGGGGTAAGAAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTCAGAAGAGGAGGCAGAACTGTTAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCGTGACCCTTCCGGCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTCAATCAACGTATCGAAGGCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTACAAGATAGCGCATGTGAAAATACGGGTGATATATAA SEQATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGGGTCTCTGGAAAACT IDCACGAAGTACTTAATCATGGCATTGCCTATTATATGAATATCCTGAAGCTTATTCGTCAAGAAGCTATATNO: ACGAGCATCATGAGCAAGATCCTAAGAACCCTAAGAAAGTAAGCAAAGCGGAAATTCAGGCTGAATTG140TGGGACTTCGTCTTGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAAGATGTGGTGTTTAATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAAGGAGAAGCTAATCAGCTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACTCTCAATCAGGTAAAGGCACTGCCAGTAGTGGTCGAAAACCGAGATGGTATAATTTGAAAATCGCAGGTGATCCATCGTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGATAAAAAAAAAGATCCCCTTGCCAAAATACTAGGTAAGCTAGCCGAGTATGGACTTATACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTGAAGGAAATCAAATGGATGGAAAAATCACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTTATACAGGCTCTTGAGAGGTTTTTGTCGTGGGAGTCCTGGAATTTGAAAGTGAAAGAAGAATATGAAAAAGTGGAAAAGGAGCATAAGACGTTGGAAGAAAGGATTAAGGAAGATATTCAGGCCTTTAAGAGTCTGGAACAGTACGAAAAAGAAAGACAGGAACAGTTATTGAGAGATACTCTAAACACTAATGAATATAGGCTTTCCAAGAGGGGCTTGCGAGGATGGAGAGAGATAATTCAGAAATGGTTGAAAATGGATGAGAACGAGCCATCGGAGAAATATCTAGAGGTGTTTAAAGATTACCAAAGAAAGCACCCTCGCGAAGCTGGTGATTACTCTGTTTATGAATTCCTTTCGAAGAAGGAAAATCACTTCATCTGGCGAAATCATCCAGAGTACCCATATTTATATGCTACATTTTGCGAAATTGACAAGAAAAAAAAAGATGCTAAACAGCAAGCGACATTCACCCTCGCTGATCCCATCAACCACCCATTATGGGTCAGGTTCGAAGAGAGATCAGGCTCGAACCTGAATAAGTACAGGATCTTGACTGAGCAATTGCATACTGAGAAGTTAAAAAAGAAATTGACGGTCCAACTTGACAGATTGATTTATCCCACTGAATCTGGTGGATGGGAGGAGAAAGGTAAGGTTGATATTGTCCTATTGCCTTCTCGTCAATTTTACAACCAAATATTTCTGGACATCGAAGAGAAGGGTAAACATGCTTTTACCTATAAGGATGAGAGTATTAAATTTCCATTGAAGGGAACGCTTGGCGGCGCTAGAGTTCAGTTCGATAGAGATCATTTGAGAAGATACCCGCATAAAGTGGAATCTGGTAATGTAGGTCGGATCTACTTTAACATGACGGTAAATATTGAACCTACCGAGTCACCAGTCAGTAAGTCTTTAAAGATTCATAGGGATGATTTCCCTAAATTTGTCAACTTCAAGCCTAAGGAACTAACCGAGTGGATCAAAGACAGTAAAGGCAAAAAGTTAAAGAGCGGTATTGAGTCCCTGGAGATAGGTCTTAGAGTCATGTCTATCGATTTGGGTCAAAGACAAGCAGCCGCAGCATCTATTTTCGAAGTTGTTGACCAAAAACCGGATATCGAGGGGAAATTATTTTTTCCAATAAAAGGAACTGAGCTATACGCTGTGCATCGCGCATCCTTCAATATAAAACTGCCAGGAGAAACACTAGTAAAATCTAGAGAGGTCTTGCGTAAAGCACGTGAGGACAATCTCAAATTAATGAATCAGAAGTTAAATTTCCTTAGGAACGTGTTGCATTTCCAACAGTTCGAGGACATAACTGAACGCGAGAAAAGAGTCACTAAGTGGATCTCAAGACAAGAAAATAGTGATGTGCCATTAGTGTATCAAGACGAACTTATTCAAATAAGAGAGCTAATGTATAAACCATATAAAGACTGGGTGGCATTCTTAAAACAATTACACAAGCGGCTTGAAGTAGAAATAGGAAAAGAAGTAAAGCATTGGAGGAAGAGTCTGTCCGATGGTCGCAAAGGCCTGTACGGGATATCACTTAAAAATATTGATGAAATTGACAGAACACGAAAATTTTTGTTAAGATGGTCATTGAGACCAACCGAACCAGGTGAGGTTAGAAGGTTGGAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACCATCTTAACGCACTGAAAGAAGATAGATTGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTTATTGCTATGATGTGAGAAAGAAAAAATGGCAAGCCAAGAACCCTGCATGCCAAATTATTTTGTTTGAAGATCTTTCTAATTACAATCCATACGAAGAGCGTTCACGTTTTGAAAACTCTAAATTGATGAAATGGTCTAGAAGAGAGATTCCGAGACAGGTCGCTCTACAAGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGCTCAATTTTCTTCCAGATTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAAAAGTTACAGGACAACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACTTGATAAGATAGCTGTCCTTAAGGAAGGCGATCTGTACCCAGATAAAGGTGGTGAGAAATTCATATCTTTGAGTAAAGACAGGAAACTGGTTACAACACACGCCGACATTAACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGACCCACGGCTTCTATAAGGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAATCAAAGGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGAAGGATGGAGTTTATGAGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTCATCAAAGCAATCAAGTAGCGAACTGGTCGATTCGGATATTTTAAAGGATAGCTTTGATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAATGTTATACAGAGATCCCAGTGGGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTTTGGCAAATTAGAGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAAGACGACTCTTCAAAACAATCCATGTGA SEQATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATGCGACTCTAAGACGG IDGCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAAGAGAATTGAAGAATTTTTACTGTTGTGTANO:GAGGAGAAGCTTATCGTACCGTAGATAATGAAGGTAAAGAAGCTGAGATCCCACGCCATGCTGTTCAAG141AAGAGGCGCTTGCTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACATATGAGGACCAGGAAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAATGAAAACAACGAGGCTGGGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAATGAGTGCAGAGTCCGAAGGTGGACTATCGGTCTATGATAAAGTGTTAGACCCGCCGCCAGTATGGATGAAACTCAAAGAAGAGAAAGCGCCTGGTTGGGAAGCTGCTTCTCAGATTTGGATACAGTCCGACGAAGGTCAATCGCTGCTAAATAAACCGGGTAGCCCACCACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAGATGACTTCGTTTCAGACCAAAAAAAAAAGCAAGATGAACTAACGAAAGGTAACGCACCACTCATAAAACAATTGAAAGAGATGGGCCTCTTGCCTTTAGTTAATCCCTTTTTTAGACATTTGTTGGATCCCGAGGGTAAGGGTGTATCCCCATGGGACAGATTGGCCGTAAGGGCCGCGGTGGCGCACTTCATCTCTTGGGAAAGTTGGAACCACAGAACAAGAGCTGAGTATAACAGTTTGAAACTGCGAAGAGATGAATTTGAGGCCGCATCTGATGAATTCAAGGACGATTTTACATTGCTACGACAATATGAGGCTAAGCGACATAGTACGCTTAAGTCAATTGCCTTAGCTGATGACTCTAACCCGTACCGAATTGGTGTAAGGTCCTTGAGAGCCTGGAATAGGGTTAGAGAAGAATGGATTGACAAAGGCGCAACCGAGGAACAAAGGGTTACCATCCTTAGTAAGCTTCAAACACAATTACGGGGTAAATTCGGTGATCCAGACCTATTTAATTGGCTAGCCCAAGATAGACACGTACACCTGTGGTCCCCGAGAGATTCCGTCACGCCCCTCGTAAGGATTAATGCCGTCGACAAAGTGCTTAGAAGACGTAAGCCTTATGCACTGATGACTTTTGCACATCCGAGATTCCATCCAAGATGGATTCTATACGAAGCGCCTGGTGGTTCTAACTTGCGACAATACGCTTTAGATTGTACTGAAAATGCTCTGCATATTACACTTCCATTACTCGTCGACGACGCCCATGGTACATGGATTGAGAAAAAAATCCGCGTACCACTCGCTCCTAGTGGACAAATACAAGATTTAACTTTAGAAAAACTTGAAAAGAAAAAAAACAGATTATACTATAGATCAGGATTCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGGTGTTGTTTCATAGGCCGTATATGGAACATGATGAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAGGCGCTGTGTGGTTCAAATTAACCTTAGATGTTGCTACCCAAGCACCACCTAACTGGTTAGATGGTAAAGGCAGAGTTAGGACACCTCCAGAAGTTCATCATTTCAAAACCGCTCTGTCAAATAAATCTAAACATACGAGAACCTTGCAACCAGGATTGAGAGTCCTTTCTGTTGATTTGGGTATGAGAACATTTGCTTCTTGTTCTGTTTTCGAATTGATCGAAGGTAAACCTGAAACAGGTAGAGCATTCCCTGTTGCTGACGAAAGATCAATGGATAGTCCAAATAAGTTATGGGCCAAGCACGAGAGAAGCTTTAAACTAACTCTGCCTGGAGAAACACCGAGCAGAAAGGAGGAAGAAGAGAGAAGCATTGCTAGGGCAGAGATTTACGCGCTGAAAAGAGATATTCAAAGACTGAAATCACTCCTAAGATTAGGTGAGGAAGATAATGATAATAGAAGAGATGCTTTGTTAGAGCAATTCTTTAAAGGATGGGGTGAAGAGGACGTAGTTCCTGGTCAAGCTTTCCCTAGAAGCCTCTTTCAGGGATTAGGCGCTGCACCCTTTAGGTCAACACCCGAATTGTGGAGACAGCACTGTCAGACGTATTACGACAAAGCGGAAGCTTGCCTGGCAAAGCATATTTCCGACTGGAGGAAGAGAACTAGACCTCGTCCGACTTCGAGAGAGATGTGGTATAAGACAAGATCTTACCATGGTGGCAAAAGTATTTGGATGCTAGAATACTTAGATGCTGTCCGCAAATTACTACTTTCATGGTCGTTAAGAGGTCGTACTTACGGAGCTATTAATAGACAAGACACCGCTCGTTTTGGTTCCTTAGCTTCTAGATTGTTGCATCATATCAACTCTTTAAAGGAAGACCGCATCAAAACCGGTGCAGATAGTATTGTGCAGGCCGCAAGGGGCTATATTCCTCTCCCACATGGCAAGGGTTGGGAACAGCGTTATGAACCCTGTCAGTTGATATTATTTGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAGAGAGAATTCGCAATTGATGCAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTGAACTATACGGTCAAATAGTCGAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCATGCTGCAACTGGTGCTCCTGGTGTCAGATGTCGCTTTTTGTTAGAACGAGATTTCGATAATGACCTACCAAAGCCGTACTTACTGAGAGAACTAAGTTGGATGTTAGGTAACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGAGAAAATTAGACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCGAAAAGACAAACTCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGCAACGCAGATTTTTCGGAAGGTGTGGGGAAGCCTTTCGCCTTGTGTGTCAGCCACATGGTGATGATGTTTTGAGGCTAGCGTCTACACCAGGTGCAAGACTTTTGGGTGCATTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTTCGTGATATGGGTAGCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCAAGCCATTACAAGACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTACCTGAAGAAGATGATACCGGACGAATTACTGTATTTCGGGACTCTTCGGGTATATTCTTCCCTTGTAACGTTTGGATCCCGGCAAAACAGTTCTGGCCTGCGGTCCGTGCTATGATTTGGAAGGTTATGGCATCACATTCATTGGGTTAG SEQATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTAAAAAGAGAGAAGT IDTCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCCCGTCAAAAAAGGTCAGGTGACAGAATTNO: TCGAAAAGCATTTTCCGCATACGCCCGAGCAACCAAAGGGGAAATGACGGATGGCAGAAAAAATATGT142TTACTCACTCATTTGAACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGCTGACAAAGCCTACCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCCCATGCACTTGGTTTTAGGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAGCGTCCTCAAAGATTGAAGCTTACGAAAGCAAGTTAGCTAGCGAGCTTGCTTGTGTTGATTTGTCTATTCAGAACTTGACTATTTCAACTTTGTTCAACGCATTAACGACTTCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGATCCACTGATAGCTAGATTTTACACCTTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCAGAACGAGATTTAGCCGAGGTGATAAGCAGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAGATGACTGCCAATCCACTTCAATCTCTTCAATTTTTTGAAGAGGAGTTGCATGCGCTAGATGCAAATGTTAGTTTGTCACCTGCCTTCGATGTTCTGATTAAGATGAACGACCTGCAGGGTGACTTGAAGAACAGAACGATAGTTTTTGATCCAGATGCTCCTGTGTTTGAATATAATGCTGAGGATCCTGCTGACATCATCATTAAACTGACAGCTAGATATGCGAAAGAAGCAGTGATTAAAAATCAAAATGTCGGGAATTATGTTAAGAACGCTATTACGACAACTAACGCAAACGGACTAGGTTGGTTGCTGAACAAAGGCCTTTCCTTATTGCCTGTCTCCACTGATGACGAACTATTGGAGTTTATTGGGGTCGAGAGATCCCATCCTAGCTGTCATGCGTTGATAGAACTTATCGCTCAGTTAGAAGCACCTGAACTGTTCGAAAAAAATGTTTTTTCTGATACTCGTTCCGAGGTTCAAGGTATGATAGATTCAGCTGTAAGCAATCATATCGCCAGGCTGTCAAGCTCTCGTAATTCATTGAGCATGGACTCAGAGGAACTTGAGAGATTGATAAAATCTTTTCAAATTCATACACCACATTGTTCATTATTTATAGGGGCTCAATCCTTATCTCAACAATTGGAAAGCCTACCCGAAGCATTGCAGTCAGGAGTGAACAGTGCTGATATTCTGCTCGGCTCAACCCAATACATGTTGACAAATTCTTTGGTCGAGGAGTCAATCGCTACGTATCAGAGAACCTTAAATAGAATTAACTACCTGTCCGGCGTTGCAGGACAGATTAACGGTGCTATTAAGAGGAAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGCTTGGTCAGAGTTAATTTCTTTACCCTTTATTGGGCAACCAGTGATTGATGTTGAATCAGATTTAGCCCACTTAAAGAACCAATACCAGACATTGTCTAACGAATTTGATACGCTGATTTCCGCACTGCAAAAGAATTTCGACTTAAATTTTAATAAAGCCTTGCTTAATCGAACACAACATTTCGAGGCTATGTGTAGATCAACAAAAAAGAATGCCCTTTCTAAGCCTGAGATCGTTAGTTATAGAGATTTGCTAGCCAGGTTGACTTCTTGTCTTTATAGGGGCTCTCTAGTCTTGAGGAGGGCGGGTATAGAAGTACTGAAAAAGCACAAGATATTTGAGTCCAACTCTGAATTAAGAGAGCACGTTCATGAAAGAAAACACTTCGTATTTGTTTCTCCGCTCGATAGAAAAGCCAAGAAGCTCCTACGTTTGACTGACTCTAGGCCTGATTTATTGCACGTAATTGATGAAATACTACAACATGATAATTTAGAGAACAAGGATAGAGAATCTTTGTGGTTAGTTCGATCTGGTTATTTACTGGCCGGCCTACCAGACCAACTCTCCTCTTCCTTTATAAATCTTCCAATCATTACTCAAAAAGGCGATCGTCGCTTGATAGATCTCATTCAATACGACCAAATTAATAGAGATGCTTTTGTGATGTTGGTAACTTCCGCTTTTAAGTCGAACTTAAGTGGGCTGCAGTACAGAGCAAACAAACAATCTTTTGTGGTTACGCGCACTTTGTCACCATATTTGGGATCTAAATTGGTTTATGTGCCCAAAGATAAAGATTGGCTGGTCCCTTCCCAAATGTTCGAGGGGAGATTTGCGGACATTTTGCAATCCGATTATATGGTGTGGAAGGACGCTGGAAGATTGTGTGTTATTGACACAGCTAAGCATTTGTCTAACATTAAAAAATCTGTATTCTCAAGTGAAGAAGTCCTCGCGTTTTTAAGAGAATTGCCACACCGTACGTTTATCCAAACTGAGGTCAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATAACGGGGATATACCCTCTCTGAAGACGTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAACACTAGTCTGGTCCAAACATTAAATAGATGGTTTGAAGGCGGTAAGGTCTCGCCGCCTAGCATCCAATTTGAGAGAGCATATTACAAAAAAGATGATCAAATCCACGAGGACGCTGCAAAAAGGAAGATAAGGTTTCAAATGCCAGCTACAGAGTTGGTACACGCGTCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATCCCGGTGAATATGGTATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGGATTTATACACATAAATTCATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAAACCAAAGTTGTTCCGAGGCAGCAATACAAGTCACCATACGCCAACTATCTAGAACAATCTAAAGATTCTGCAGCAGGAGACATAGCTCATATTTTGGATAGACTTATCTACAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAATAGTCAAAGCGCAGCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATGCACAAAATTCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAAAGGCATGTTGAGGCAACCGCCAACAGAAAAAAAGCCCAAACCATACATTGCCTTTCCCGGTTCACAAGTTTCTTCTTATGGTAATTCTCAAAGGTGTTCATGTTGTGGACGTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGACACATCCATTAAGGAGTTGAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATTTAATCCAGACCCGTCAACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAATTCCTGTAGCTGATAGAACTTTCAAAAATATAAGTCCAAGCTCACTAGAATTCAAAGAACTAATAACGATTGTGTCACGGTCTATACGTCATTCCCCAGAATTTATTGCTAAAAAAAGAGGTATAGGTAGTGAGTACTTTTGTGCTTATAGTGATTGTAATTCCTCCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGTTCCAAAAGCAATTGTTTTTCGAATTATAG SEQATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAGGTAAAGGATCTGAACIDTTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCAGGGTGCAGTTGAGGAGTTAGCTGAGGCGATNO:CCGCCATGATAACCTACATCTGTTTGGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGATGAAGGTAC143GCAAGTTTACTCAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCAGTCCATCAGCAAACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCTTTTAGGAAAGTGTTAGAGCAGTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTGAACCGGCAGAACGTATATTATCTGTCGAGATCCGTAAGATTGGGAAGAGGGAAAACAGAGTTGAGAACTATGCTGCTGACGTAGAAACGTGTTTTATAGGCCAATTAAGTTCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGATATCTGGGATAGTAAAGATCATGAAGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTGATTAAGGATCCAAAGGCTGTGACCGAAGAGGATCCTGAAAATGAAACTGCTGGTAAACAAAAACCCTTGGAGTTGTGTGTCTGCCTTGTCCCAGAACTTTACACAAGAGGATTCGGGTCAATAGCCGATTTTTTGGTTCAACGCTTAACTCTTTTAAGGGATAAAATGTCTACAGATACTGCAGAAGATTGTTTAGAATATGTCGGGATTGAGGAGGAAAAAGGTAACGGCATGAACTCATTGTTGGGAACGTTCTTAAAGAATTTGCAAGGCGATGGATTTGAGCAGATTTTCCAATTTATGTTAGGGAGCTATGTCGGTTGGCAAGGGAAGGAAGATGTTTTAAGAGAGAGATTAGACTTATTGGCTGAAAAAGTGAAGAGGTTACCGAAACCAAAATTTGCTGGCGAATGGTCTGGTCATAGGATGTTCTTGCATGGCCAATTGAAGTCTTGGTCTTCAAATTTTTTTAGACTATTTAACGAGACAAGGGAACTTCTAGAGTCTATTAAGTCAGATATACAGCATGCCACAATGCTAATATCATATGTAGAAGAAAAAGGTGGTTATCATCCTCAATTACTTAGTCAATATAGAAAACTTATGGAACAACTACCAGCTTTGCGTACCAAGGTATTGGACCCTGAGATTGAAATGACACATATGTCCGAAGCAGTTCGCTCTTATATAATGATACATAAATCTGTTGCGGGTTTTTTACCGGATTTATTAGAATCATTAGATAGAGACAAGGATCGTGAGTTTCTGCTTAGTATTTTTCCAAGAATCCCAAAAATTGATAAAAAAACCAAGGAAATTGTAGCTTGGGAACTGCCGGGAGAACCAGAAGAAGGTTATTTATTTACTGCTAATAACTTGTTCAGAAACTTCTTAGAGAATCCGAAACATGTCCCGAGATTTATGGCCGAAAGGATCCCAGAAGATTGGACTCGATTACGCTCTGCTCCTGTCTGGTTCGATGGAATGGTAAAACAATGGCAAAAAGTCGTTAACCAGTTAGTAGAATCACCAGGTGCTTTATATCAATTTAACGAATCCTTCTTGAGACAAAGGTTACAGGCCATGTTAACTGTGTATAAGAGGGACTTACAAACTGAAAAATTTCTTAAACTTTTGGCGGATGTTTGTAGGCCTCTTGTAGATTTTTTTGGTTTGGGTGGAAATGATATTATTTTTAAGAGCTGTCAAGACCCAAGAAAACAATGGCAAACCGTTATTCCTCTCTCTGTTCCGGCAGATGTCTATACTGCTTGCGAAGGTTTGGCGATTAGACTAAGGGAGACATTAGGATTCGAATGGAAGAATTTGAAAGGTCACGAGAGAGAAGATTTCTTAAGATTGCACCAGTTATTGGGCAATTTACTTTTCTGGATTCGTGATGCTAAATTGGTAGTAAAATTAGAGGATTGGATGAACAACCCATGTGTTCAGGAATATGTAGAAGCCCGGAAAGCTATCGATCTTCCACTAGAAATATTCGGTTTTGAAGTGCCTATCTTCCTGAATGGCTATCTATTTTCGGAGTTGAGACAATTAGAACTTTTGCTTAGGAGAAAAAGTGTGATGACTAGCTACAGTGTAAAGACTACTGGATCTCCTAATAGGCTATTTCAGCTAGTTTATTTACCTCTAAACCCTAGTGACCCCGAAAAGAAGAACTCAAATAACTTTCAAGAACGTTTGGATACCCCAACTGGTTTGTCCCGTCGTTTCCTAGACCTAACCCTTGATGCATTCGCAGGTAAGTTACTTACCGATCCAGTTACACAAGAATTGAAGACAATGGCAGGTTTTTACGATCATCTTTTTGGATTCAAATTGCCATGTAAACTCGCCGCCATGTCGAATCATCCAGGTTCTTCTTCAAAGATGGTTGTGTTAGCGAAACCCAAAAAAGGTGTTGCTTCTAATATAGGGTTTGAACCGATCCCAGATCCCGCTCATCCCGTATTTAGGGTTAGATCCAGTTGGCCAGAGTTGAAGTACCTCGAGGGGCTATTGTATTTGCCAGAAGACACACCTTTGACCATCGAATTAGCAGAGACCTCCGTATCGTGCCAAAGTGTCTCGTCAGTTGCATTCGATTTGAAAAACTTGACAACGATCTTAGGTCGTGTGGGAGAATTTAGGGTCACAGCTGATCAACCCTTTAAACTAACGCCTATAATCCCGGAGAAAGAAGAATCTTTTATTGGTAAAACTTATTTGGGTCTCGACGCGGGTGAAAGGAGCGGCGTCGGTTTCGCTATTGTTACAGTGGACGGAGATGGGTACGAAGTGCAAAGATTGGGGGTCCACGAGGATACACAGCTTATGGCCTTGCAGCAAGTTGCTAGTAAATCCTTAAAAGAGCCAGTATTTCAGCCTCTAAGAAAAGGCACCTTTAGACAACAAGAAAGAATACGGAAATCCTTACGTGGTTGCTACTGGAATTTTTATCATGCCTTGATGATAAAATATAGGGCCAAAGTAGTACATGAGGAATCTGTCGGAAGTAGTGGTCTTGTGGGTCAATGGTTGAGGGCTTTTCAGAAGGATTTGAAGAAAGCCGATGTTCTCCCCAAGAAGGGCGGTAAAAACGGTGTAGATAAGAAGAAGAGAGAGTCCTCAGCTCAAGACACTCTTTGGGGTGGTGCTTTCTCTAAAAAGGAGGAGCAACAGATTGCGTTTGAGGTGCAAGCTGCAGGTTCTTCGCAATTTTGTTTGAAGTGCGGATGGTGGTTCCAACTAGGCATGCGTGAAGTAAACAGGGTACAAGAATCGGGCGTCGTGTTAGATTGGAATAGAAGCATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCCCCACAGCAATTAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAAGAGACTTTATGAGACCTCCTATGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTACGAGAGATTTGTTTTAGGAAGGAGACATCGAAGGTACAGGTTTGATAAAGTATTTGAGGAAAGATTTGGGAGGTCTGCTCTTTTCATTTGTCCTAGAGTAGGTTGTGGAAATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTTGATCGGATATATTGCCGATAAGGAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGAACTTATGGCCGAATGGAAACTAAAAAAATTAGAAAGATCCAGAGTTGAAGAACAATCATCCGCTCAATAA SEQATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACGAAGTCATCGGTTT IDGGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAACCATACCTCGGCAAGAAAGATACAAAANO: CAAAAAAAAAAGAGATAAAAAATATGGGTCAGCCAGTAAAGCCCAATCTCAAAGAATTGCTGTAGCAG144GTGCTCTTTACCCTGACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAGACTTGAATGGTGAGGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGACGAGATTGGACTTTTGGGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAACAGTCTGAGCCTTACAGTGTAGTCGATTTCTGGTTTGATGCAGTGTGCGCAGGGGGAGTCTTCGCCTACTCTGGCGCTAGATTATTGAGTACAGTTTTACAGTTATCCGGTGAGGAATCGGTGCTTAGAGCTGCCTTAGCCTCGTCTCCATTCGTTGACGATATAAACTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACAGGTCAAGATAAGTTAGGTAAGAGAATTGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGGGATAAAAGATAGAATGAGGGAATTTGTTCAAGCTATCGATGTTGCACAGACCGCCGGACAACGTTTCGCTGCCAAATTGAAGATATTCGGTATAAGTCAGATGCCAGAAGCTAAGCAATGGAATAACGATTCCGGACTGACTGTCTGTATACTACCTGATTATTATGTTCCCGAAGAGAATCGCGCGGACCAACTTGTAGTGTTGTTAAGAAGACTTCGCGAGATTGCATATTGCATGGGTATTGAAGATGAAGCGGGTTTCGAACATCTTGGAATAGATCCTGGTGCTCTTTCGAATTTTTCAAACGGTAACCCTAAGAGAGGATTTCTAGGGAGGCTGTTAAATAACGATATTATTGCGTTGGCAAACAATATGAGTGCGATGACTCCATATTGGGAAGGGCGTAAGGGTGAACTCATAGAAAGGCTTGCGTGGTTAAAGCACAGGGCAGAAGGGCTGTATCTTAAAGAACCTCATTTCGGTAACTCCTGGGCCGATCATAGGTCACGAATTTTCTCAAGGATCGCAGGCTGGTTATCTGGTTGCGCTGGCAAGTTGAAAATTGCGAAAGACCAAATTTCTGGAGTACGTACAGATCTATTTCTGCTAAAAAGACTGCTGGACGCAGTTCCGCAATCGGCGCCATCCCCCGATTTTATTGCGTCAATTTCGGCACTTGACAGGTTTTTAGAAGCTGCAGAATCGAGCCAGGACCCTGCTGAACAAGTGAGGGCTCTCTACGCTTTTCACTTGAACGCACCTGCAGTCCGAAGTATAGCCAATAAAGCAGTGCAAAGGTCCGACAGCCAAGAATGGCTGATAAAAGAACTAGACGCTGTTGACCATTTAGAATTTAACAAAGCGTTCCCATTTTTCTCTGACACAGGAAAAAAAAAAAAAAAAGGTGCTAATAGCAACGGTGCTCCATCGGAAGAAGAGTACACTGAAACGGAATCAATACAACAACCTGAGGACGCGGAACAGGAAGTAAACGGACAAGAAGGGAACGGAGCGTCTAAAAATCAAAAGAAATTTCAAAGAATACCTAGATTCTTCGGTGAAGGCTCCAGATCTGAATACAGAATTTTAACGGAAGCTCCACAGTATTTCGATATGTTTTGTAATAACATGAGGGCTATATTTATGCAGTTAGAAAGTCAACCCCGTAAAGCTCCCAGAGATTTTAAATGTTTCCTACAAAATCGATTACAAAAATTATACAAACAGACTTTCTTGAATGCACGAAGCAACAAGTGTCGCGCTCTGCTTGAGTCAGTTTTAATCTCTTGGGGAGAATTTTATACATACGGTGCCAACGAAAAGAAATTTAGATTAAGACATGAAGCTTCAGAACGCAGCAGTGACCCAGATTACGTAGTTCAGCAAGCCTTGGAAATCGCGCGTCGTCTATTCCTTTTTGGCTTCGAATGGAGAGATTGCTCCGCTGGTGAAAGAGTGGATTTGGTTGAAATTCACAAAAAGGCTATCAGTTTTTTGTTGGCTATTACTCAAGCTGAGGTCTCTGTTGGTTCATACAATTGGCTTGGCAACTCAACAGTATCGAGATATTTATCCGTTGCGGGAACTGATACCTTATACGGTACCCAATTGGAAGAATTCCTGAACGCTACAGTGTTGAGTCAAATGCGTGGTCTGGCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGTTTTGATGTGCAGCTCGAGTCTTCCTGCCAGGACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAGAGATTTGGCGGCTTGCAAGCGCGCTACTTGTCCAGCCGAACTCGATCCTAAGATTTTAGTTTTACCGGTAGGTGCATTCATCGCTTCCGTAATGAAAATGATAGAAAGAGGTGACGAACCTTTAGCTGGTGCTTATTTACGGCATAGGCCACACTCTTTCGGATGGCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGTACAGCATTGGCCTTTCAAAAGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCACAGTATGGACCAGTTCTGTGGTTGAACAGTAGTAGTTATTCTCAATCACAATATTTGGACGGTTTTCTATCTCAACCAAAAAATTGGAGTATGAGGGTGTTGCCTCAGGCGGGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGATATGGAACTTACAAGCAGGCAAGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACCGTTTTCATTTAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTGGGTCTGTTCCCTCATTCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATTCTGCTGGATTTAAAATTCTTGAAAGAGGCACTATTGCTGTCAATGGTTTCTCTCAGAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTGAAAAACAAAGAAGGGGGATAAGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGGCGAACGAATTGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCAATGGGCACCACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCGAGGGCTGTTAGGACAGAAGCTCCGAGGAGCGGCAACCAAGAAGATCATGCAAGAATGAAAAGTTCTTGGGGTTACACCTGGGGTACGTATTGGGAGAAACGAAAACCAGAAGATATTTTAGGGATTTCTACACAGGTGTATTGGACAGGAGGTATAGGCGAATCCTGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTCAACACAAACGGAGTGGGAAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAGTTAA SEQATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAAAGCCAGTCTCTCGT IDTCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGATGATTTAAAAAAGAGGTTGGAAAAGCGTNO:AGAAAAAAACCAGAAGTGATGCCGCAAGTGATCTCAAATAACGCAGCTAATAATCTAAGGATGCTACTT145GATGATTATACAAAAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAAGGATGACCATGTTGGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAAATTGAAACCCGAAATGGACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCTCGCAATGTGGGCAACCATTATTTGTTTATAAATTAGAGCAGGTTTCGGAAAAAGGAAAGGCTTACACAAATTACTTCGGCAGATGTAATGTTGCCGAACACGAAAAACTCATATTGTTAGCTCAGTTGAAGCCTGAGAAAGACTCTGATGAGGCCGTTACTTACTCGTTGGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCATGTGACAAAGGAGTCCACACATCCCGTCAAGCCCTTGGCACAAATTGCGGGTAATAGATACGCTTCGGGTCCAGTTGGGAAGGCCCTTTCTGATGCATGTATGGGCACAATTGCTAGCTTTCTTAGTAAATACCAGGATATCATAATAGAGCATCAAAAAGTTGTAAAGGGTAACCAAAAGAGATTAGAATCGCTGCGTGAGTTGGCGGGTAAAGAAAACTTGGAATATCCATCTGTCACTCTGCCTCCTCAACCTCATACTAAGGAAGGTGTAGATGCGTACAATGAAGTTATCGCTAGAGTCCGTATGTGGGTGAATTTAAATTTGTGGCAAAAATTGAAGTTATCGCGTGATGATGCAAAACCTCTTCTTAGACTAAAGGGCTTTCCTAGCTTCCCTGTAGTGGAAAGACGCGAAAATGAAGTCGATTGGTGGAATACAATTAACGAAGTCAAAAAACTGATCGATGCAAAGCGAGATATGGGTCGAGTTTTTTGGTCTGGTGTTACAGCTGAAAAAAGGAATACGATCTTAGAAGGTTACAACTACTTGCCAAATGAGAACGATCATAAAAAAAGAGAAGGCAGTTTAGAAAATCCAAAAAAGCCAGCTAAGAGACAATTTGGTGATTTGCTACTTTACCTAGAAAAAAAGTACGCCGGAGATTGGGGGAAAGTCTTTGACGAAGCTTGGGAGAGAATAGATAAAAAAATAGCAGGATTGACGTCACACATTGAAAGAGAAGAGGCGAGAAATGCAGAAGATGCTCAGTCCAAAGCTGTCCTCACCGACTGGTTGAGAGCCAAAGCGTCCTTTGTTCTCGAACGCCTAAAAGAAATGGATGAGAAGGAATTTTATGCCTGCGAAATCCAGCTACAAAAATGGTACGGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGCAGAGAACCGTGTTGTAGATATCTCCGGTTTCTCAATCGGTAGCGATGGACACTCCATTCAGTATCGCAACTTGTTGGCCTGGAAATATTTGGAAAACGGTAAGAGGGAATTCTATTTACTTATGAATTATGGCAAGAAAGGTAGAATCAGGTTTACTGACGGAACAGACATTAAAAAGAGTGGTAAGTGGCAAGGCCTTTTGTACGGTGGTGGCAAGGCCAAAGTAATAGACTTAACATTTGACCCCGACGACGAACAACTGATAATACTGCCTTTAGCTTTTGGTACTCGACAGGGGCGAGAGTTCATTTGGAATGATCTTTTGTCACTCGAGACTGGTTTGATAAAACTTGCAAATGGAAGAGTCATCGAGAAGACAATTTACAACAAAAAGATAGGTCGCGATGAGCCTGCACTATTTGTGGCCTTGACCTTTGAGAGAAGGGAAGTTGTCGACCCATCCAATATTAAACCAGTCAACCTAATCGGTGTAGATAGAGGTGAAAACATCCCAGCTGTTATCGCTCTGACAGACCCTGAAGGTTGCCCTTTGCCAGAATTTAAAGATTCGTCTGGTGGACCAACAGATATATTACGTATTGGGGAAGGCTATAAAGAGAAACAACGTGCTATTCAGGCTGCAAAAGAAGTTGAACAGAGGAGAGCTGGAGGTTACAGTAGAAAATTCGCCAGTAAAAGTAGAAACTTAGCAGATGACATGGTTAGAAACTCTGCCCGGGATTTGTTCTATCATGCGGTTACTCACGATGCAGTCTTAGTCTTTGAAAATCTATCGCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTATGACAGAGAGACAATATACAAAAATGGAAGATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTCGAAAACGTACCTCAGCAAAACGCTGGCTCAATATACTTCTAAAACTTGTTCAAATTGTGGTTTTACTATTACCACGGCAGACTACGACGGGATGTTGGTGAGATTGAAGAAGACGAGCGATGGTTGGGCAACAACATTGAATAATAAGGAATTAAAAGCAGAAGGACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTGAGAAAGAGTTGTCAGCCGAGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGGACTAAAGGGAGGCGGGATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAGACCTGTGCAAGAACAATTCGTTTGTTTAGATTGTGGCCATGAGGTTCATGCAGACGAACAGGCTGCGTTAAATATTGCGAGAAGCTGGCTATTTCTAAATTCTAATTCAACAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTAGGCGCGTGGCAAGCCTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA SEQATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGAAAGCTGGTAAGAC IDTGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGATTTGAGAGAAAGATTGGAGAACCTCAGNO:GAAAAAGCCCGAAAACATCCCACAACCCATTAGTAACACATCAAGAGCTAATTTAAACAAGTTATTAAC146TGACTACACTGAAATGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAAAGATCCTGTTGGGTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACTTATTCCTGTGAAGGACGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCCAGTGCTGCCAACCCCTGTATGTATACAAACTGGAACAAGTAAATGATAAAGGTAAGCCACATACTAACTACTTTGGTAGGTGTAATGTATCCGAGCATGAAAGATTGATCTTGTTAAGTCCCCATAAACCAGAAGCTAATGATGAGTTAGTAACTTATAGTTTAGGTAAGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTACAAGAGAAAGCAATCACCCCGTCAAACCACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGTCCAGTAGGCAAAGCTTTATCAGACGCTTGCATGGGTGCCGTGGCTAGTTTTTTGACGAAATACCAAGATATTATACTGGAACATCAAAAGGTAATTAAAAAGAATGAAAAGAGACTCGCTAACTTAAAAGATATTGCAAGTGCCAATGGTTTAGCTTTTCCTAAAATTACCTTGCCACCTCAGCCACATACAAAGGAGGGAATTGAAGCTTACAATAATGTAGTAGCCCAAATAGTTATTTGGGTGAACCTTAACCTATGGCAAAAGTTAAAAATTGGTAGAGACGAAGCCAAACCCCTGCAGAGGCTGAAGGGTTTTCCCTCCTTCCCCTTAGTAGAGAGACAAGCTAATGAAGTGGACTGGTGGGATATGGTGTGCAATGTTAAAAAATTGATTAATGAGAAGAAAGAGGATGGTAAAGTGTTTTGGCAGAATCTTGCTGGCTACAAGAGACAGGAAGCTTTACTGCCTTATTTATCTTCTGAGGAAGATAGGAAAAAAGGTAAAAAATTTGCTAGATATCAATTCGGAGACCTACTTCTGCATTTAGAAAAAAAACATGGCGAAGATTGGGGTAAAGTTTATGATGAAGCCTGGGAAAGAATTGATAAGAAGGTAGAAGGTCTCTCCAAACATATTAAATTAGAGGAAGAACGTAGGTCCGAAGACGCTCAATCAAAGGCAGCATTAACTGATTGGTTGAGAGCAAAAGCCTCTTTCGTTATTGAAGGATTAAAAGAAGCCGACAAAGATGAATTTTGTAGATGTGAGTTAAAGTTGCAAAAGTGGTATGGAGACCTCCGTGGTAAACCTTTTGCTATTGAGGCTGAAAATTCTATACTCGATATCTCTGGATTTTCAAAACAATATAACTGCGCATTTATATGGCAGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATTATCAATTACTTTAAAGGTGGTAAATTGCGTTTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCAAACCGTTTTTACACTGTTATCAATAAAAAATCTGGGGAAATCGTACCAATGGAAGTTAATTTCAATTTCGATGATCCTAATCTTATTATTTTACCTCTTGCTTTCGGCAAAAGGCAAGGTAGGGAGTTTATTTGGAATGATTTATTGTCGCTGGAAACGGGGTCTCTCAAACTCGCAAACGGTAGGGTGATAGAAAAAACATTATACAACAGGAGAACTCGGCAGGATGAGCCAGCTCTTTTTGTGGCTCTGACATTCGAGAGAAGGGAAGTTTTAGATTCATCTAACATCAAACCAATGAATTTAATAGGTATTGACCGGGGTGAAAATATACCTGCAGTTATTGCTTTAACTGATCCTGAGGGATGTCCTCTTAGCAGATTCAAGGACTCGTTGGGTAACCCTACTCACATCTTAAGGATTGGAGAAAGTTACAAGGAGAAACAAAGGACAATACAAGCTGCTAAAGAAGTAGAACAAAGGAGGGCGGGTGGATATAGTCGGAAATATGCCAGCAAGGCCAAGAATTTAGCTGACGACATGGTTAGGAATACAGCTAGAGACCTTTTATACTATGCCGTCACCCAGGATGCCATGTTGATATTTGAAAATTTAAGTAGAGGCTTCGGTAGACAAGGTAAGCGCACCTTCATGGCAGAGAGACAATATACTAGAATGGAAGATTGGTTGACTGCCAAATTGGCATACGAAGGTCTACCTAGTAAGACGTACTTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGCAGTAATTGTGGTTTCACAATCACTTCTGCCGATTACGATCGCGTCTTGGAAAAACTAAAAAAAACAGCGACAGGTTGGATGACTACTATTAATGGGAAAGAATTGAAGGTCGAAGGACAAATAACTTACTATAATAGATATAAACGGCAAAACGTTGTAAAAGACCTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATAATGATATTAGTTCGTGGACAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAGCCATAGGCCTGTCCAAGAAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCATGCAGACGAGCAGGCCGCTCTTAATATTGCTAGATCATGGTTATTTTTAAGATCTCAGGAATACAAGAAGTACCAGACTAACAAGACAACAGGCAACACAGATAAGCGAGCATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTGAAGGAAGTCTGGAAACCA SEQATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACGCTGTCACTGACCCC IDTCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGGAGCACACGTGTTTGCGGCCGGCAACCAGNO: AGCGCAGAGCGGAGAAGCTTCCGCACCTCCAGGAGAAGGCTGGATCGCAGGCAGCAGCGTGTGAAGCT147GGTCCAAGAGATATTTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGGCTCCACGAGTCCGCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGACCCAACATACACTGACAAGGAGTACTATTCAGATTACCCAACAATTCACCATTTGATCGTGGACCTTATGGAAAGTTCGGAGAAGCATGATCCTCGACTTGTCTATTTGGCCGTGGCGTGGCTCGTGGCACATAGGGGCCACTTCTTGAACGAGGTGGACAAGGATAACATCGGGGATGTGTTATCTTTCGACGCTTTCTATCCTGAATTCCTTGCTTTTCTGTCTGACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCCAGGCTACGCTATTGTCAAGAAATAGCGTGAACGACAAGTACAAGGCTCTTAAGTCTTTGATTTTTGGAAGCCAGAAGCCCGAGGACAACTTTGATGCAAATATCTCGGAGGACGGGCTGATTCAGCTCCTCGCTGGGAAAAAGGTCAAGGTCAATAAGCTGTTTCCACAGGAGTCAAATGACGCGAGCTTCACCCTTAACGACAAAGAGGATGCCATTGAAGAGATCCTGGGGACACTCACCCCAGACGAGTGCGAGTGGATAGCCCATATTAGGCGCCTCTTTGATTGGGCCATAATGAAACATGCGCTTAAGGACGGGCGCACGATATCCGAAAGCAAGGTCAAATTGTACGAGCAGCACCACCATGATCTGACCCAGCTAAAATATTTTGTAAAAACATATCTGGCCAAGGAGTACGATGATATCTTCCGCAACGTGGATAGTGAGACCACCAAAAACTACGTCGCGTACTCATACCACGTGAAAGAAGTTAAGGGCACGCTGCCTAAGAACAAGGCAACACAAGAGGAGTTCTGCAAGTACGTTCTCGGGAAAGTTAAAAATATAGAGTGCAGCGAGGCCGACAAAGTGGATTTTGACGAGATGATTCAACGCCTGACCGACAATTCGTTTATGCCTAAACAGGTGAGTGGAGAGAATCGCGTGATTCCATATCAGCTCTATTACTATGAACTCAAGACTATTCTGAATAAGGCCGCTAGCTATTTACCCTTCCTTACGCAGTGCGGGAAGGATGCCATTTCTAACCAGGATAAACTCTTGAGTATAATGACATTTCGAATTCCCTATTTCGTGGGTCCGCTTCGTAAGGATAACAGTGAGCACGCTTGGCTGGAGCGGAAGGCTGGCAAAATTTATCCATGGAATTTCAACGACAAGGTGGATCTGGACAAATCCGAAGAAGCCTTTATCCGCAGGATGACCAATACTTGCACATACTATCCTGGGGAGGATGTCCTTCCACTGGACTCTCTGATCTACGAAAAGTTCATGATTTTGAATGAAATTAACAACATAAGGATCGATGGGTATCCTATTTCCGTCGACGTGAAGCAGCAGGTGTTCGGGCTCTTTGAGAAGAAGCGACGGGTGACCGTGAAGGATATTCAGAATCTTCTCTTATCGCTGGGAGCCCTGGATAAACACGGAAAACTGACCGGGATAGATACTACGATTCATTCTAATTACAACACGTATCACCATTTTAAGTCACTGATGGAGAGGGGCGTCCTAACAAGAGATGACGTGGAGAGAATAGTGGAACGAATGACATATTCTGATGACACCAAGAGAGTGCGGCTTTGGCTGAATAACAACTACGGCACTCTGACGGCGGATGATGTAAAGCATATTTCCCGACTCCGTAAGCATGACTTCGGGCGGCTGTCTAAGATGTTTCTAACAGGCCTCAAGGGTGTGCATAAGGAAACTGGGGAGCGCGCTAGCATCCTGGATTTTATGTGGAACACCAATGATAACCTGATGCAGCTCCTGTCAGAATGCTACACATTTTCGGACGAAATCACCAAGCTGCAGGAGGCTTACTATGCCAAGGCCCAACTAAGCTTGAATGATTTCCTGGATTCTATGTACATCAGCAACGCCGTAAAACGACCAATTTATAGGACACTGGCAGTGGTTAACGACATTAGGAAAGCATGCGGAACAGCTCCCAAGCGAATCTTTATCGAGATGGCCCGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTGACTAGGCGGGAGCAGATCAAGAACCTTTACCGCTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTTCCTTGAGAAGATCTTAGAGAACAAGTCAGATGGACAGCTCCAATCCGATGCTCTGTATCTGTACTTCGCTCAGCTGGGACGAGATATGTACACTGGCGACCCCATTAAACTAGAACATATCAAGGACCAATCGTTTTATAATATCGACCACATCTACCCTCAGTCCATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAAGTGAGATTAACGGAGAAAAGTCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGCCCCTGTGGGATGCTTACTACAATCATGGACTGATCAGCCTGAAGAAGTATCAGAGACTGACCCGGAGTACCCCTTTCACAGACGATGAGAAGTGGGATTTTATCAATAGACAACTGGTGGAAACCAGGCAGTCCACGAAAGCTCTGGCCATTCTTCTGAAGAGAAAGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCAGTTCCGACTTTAGACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAGGACGCATTCCTCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGCGGTGGTTCATGGTTAACCAGCCATACTCAGTGAAGACCAAGACCCTTTTCACTCACTCTATTAAAAATGGCAACTTCGTGGCTTGGAATGGTGAGGAGGATCTTGGAAGAATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCACTTTACTAGATTCAGCTTTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAGGTCTCGTTCCACGAAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGCACCGCAGCTTACTACCTCCTTGTGCGATTTACGCTCGAGGATAAGAAGACCCAACACAAGCTGATGATGATTCCCGTGGAGGGACTGTACAAAGCTCGAATTGACCATGATAAAGAGTTTCTCACAGATTACGCACAAACCACCATCTCTGAGATTCTCCAGAAAGACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGGCATATTAAACTGAACAGCATGATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAGTAAAGGCAAGTCTGTCCTCTGCCATGCCATGGTACCCCTAATCGTCCCACACAAGATTGAATGCTACATCAAGGCTATGGAGAGTTTTGCTCGGAAATTTAAAGAGAATAATAAGCTGCGTATTGTGGAAAAATTCGACAAGATAACCGTTGAAGACAATCTGAATCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAACCCCTATAATAAGTTCTTCTCCACACAGTTCGATGTACTGACCAACGGGCGATCAACTTTCACAAAGCTAAGTCCTGAGGAACAGGTGCAAACACTCCTAAACATTCTTTCCATTTTTAAGACCTGCAGATCTTCAGGATGCGACTTGAAGAGCATTAACGGGAGCGCACAGGCAGCTAGGATCATGATCTCAGCTGACCTGACAGGGCTGAGTAAAAAATACTCCGACATTCGGCTTGTAGAGCAAAGCGCCAGTGGGTTGTTCGTTAGTAAGTCGCAGAACCTGCTGGAATACCTGTAA SEQATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGAACGAACTGATTCCCGID TAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGATAGACGGCGACGAACAACTGAATGAGAACNO:TATCAGAAGGCCAAAATTATCGTGGATGACTTCCTGAGGGATTTTATTAACAAGGCCCTGAATAATACC148 CAGATCGGCAATTGGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAATATCGAAAAATTACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTTCTCTTCGTACTCCATTAAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTGAGGAAGAAGAACTGGACTTGGGTAAAAAGACCTCATCCTTCAAGTATATTTTTAAAAAAAATCTGTTTAAATTAGTGCTCCCCAGTTATTTAAAGACAACTAACCAGGACAAGCTTAAGATTATCTCCTCTTTTGACAACTTTAGCACCTATTTTAGAGGCTTCTTTGAAAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTCTATTGCCTACAGAATCGTACATGATAACTTCCCAAAATTTTTGGATAACATTAGATGTTTTAATGTATGGCAGACCGAATGTCCTCAGTTAATTGTGAAGGCGGATAACTACCTCAAATCCAAGAATGTGATCGCCAAAGATAAGTCTCTTGCTAACTACTTTACGGTCGGAGCCTACGATTACTTCTTATCTCAAAACGGTATTGACTTTTACAATAACATTATCGGGGGATTGCCTGCCTTCGCCGGCCATGAGAAAATTCAGGGCTTAAACGAGTTCATAAATCAGGAATGTCAAAAGGACTCAGAGCTGAAATCAAAGCTTAAGAATCGACACGCATTTAAAATGGCGGTCTTGTTCAAACAGATCCTCAGCGATAGAGAGAAAAGCTTCGTTATTGATGAATTCGAGAGCGACGCACAGGTGATTGATGCCGTGAAGAACTTCTATGCGGAACAGTGTAAAGACAATAATGTTATTTTCAACCTATTAAACTTGATTAAGAATATCGCGTTTTTAAGTGACGATGAACTCGACGGTATCTTTATAGAAGGCAAGTACCTGTCCTCTGTCAGCCAAAAACTCTACTCAGATTGGTCCAAGCTAAGAAATGACATCGAGGACAGTGCTAACAGCAAACAGGGCAATAAAGAGCTGGCAAAGAAAATCAAGACTAATAAAGGGGATGTGGAGAAGGCGATATCTAAATATGAGTTCTCCCTCTCCGAACTGAACTCCATCGTCCACGATAATACCAAGTTTAGTGATCTGTTGTCGTGTACACTGCACAAAGTGGCCAGTGAAAAACTCGTCAAGGTGAACGAAGGCGATTGGCCCAAACACCTGAAAAATAATGAGGAGAAACAGAAGATCAAAGAACCTTTGGATGCGTTGCTCGAAATATATAACACACTGTTGATCTTCAACTGTAAAAGCTTCAACAAGAACGGGAACTTTTATGTAGACTACGATCGATGTATAAATGAACTGAGCAGCGTCGTTTACCTGTACAACAAGACTCGCAATTATTGTACGAAAAAACCATATAACACCGATAAGTTCAAGCTTAATTTCAACAGTCCCCAGCTGGGAGAAGGGTTCAGCAAATCAAAAGAAAACGATTGCCTGACATTACTCTTTAAAAAGGATGATAATTATTATGTTGGGATTATTAGGAAAGGCGCTAAGATCAACTTTGACGACACACAGGCCATAGCTGACAACACTGATAACTGCATCTTTAAAATGAATTACTTTCTGTTGAAGGACGCCAAAAAATTCATTCCAAAATGCTCTATTCAGCTCAAGGAGGTTAAGGCCCATTTCAAGAAGTCTGAAGATGACTACATCCTCTCTGACAAGGAAAAATTCGCTAGTCCTCTGGTTATCAAAAAAAGTACCTTCTTGCTGGCTACAGCTCACGTGAAAGGCAAGAAAGGGAACATTAAGAAGTTCCAAAAGGAATACAGCAAAGAGAATCCAACCGAGTACAGAAATTCTCTGAACGAATGGATCGCATTCTGTAAAGAATTTCTAAAGACGTACAAGGCCGCTACCATTTTCGATATTACCACCTTGAAAAAAGCCGAGGAGTACGCCGACATCGTCGAATTCTATAAAGACGTGGATAACCTGTGTTACAAATTGGAATTCTGCCCAATTAAGACCTCTTTCATTGAAAACCTCATCGACAATGGGGACCTCTACTTATTTAGAATTAACAATAAGGATTTTTCTTCGAAATCTACCGGAACTAAAAATCTGCACACACTGTATCTGCAAGCAATCTTCGATGAACGTAATCTCAACAACCCTACAATAATGCTGAACGGCGGTGCTGAACTGTTCTACCGTAAAGAGAGTATTGAACAGAAGAATCGAATCACACACAAAGCGGGCAGTATTCTCGTCAATAAGGTGTGCAAAGACGGGACCAGCCTGGACGATAAGATCAGGAATGAAATATATCAGTATGAGAACAAGTTTATCGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTATCAAGAAGGAAGCTACCCATGACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCCACTGTCCTCTGACCATTAACTACAAGGAAGGAGACACTAAACAATTCAATAATGAAGTACTTAGCTTTTTGCGGGGTAATCCCGATATTAACATAATTGGTATCGACCGGGGAGAACGGAACCTGATATACGTGACAGTAATTAATCAGAAAGGAGAAATCCTGGATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGAGCAGACGGTCGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGTTCCTGGGATAGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTATCGTCCATGAGATCTGTTTGTTGATGATCAAGCACAATGCTATAGTGGTTCTGGAGAACCTGAACGCAGGCTTCAAGCGAATTAGAGGGGGCCTGTCGGAAAAAAGCGTTTACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTACTTTGTAAGTAAAAAAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGCGACCAGTTTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTACCTGCCGCTTACACTTCAAAGATCGATCCTACAACTGGTTTTGCGAATGTCCTGAATCTGTCTAAGGTGAGGAATGTGGACGCAATCAAGTCTTTCTTCAGCAACTTCAACGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTAAATTCAGTTTTGATCTGGATAGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTCCAAATGGAACGTCTACACGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCGGGAAGACAAAAGGATCAATCTGACTTTCGAAATGAAGAAACTATTGAATGAGTACAAAGTCTCATTCGATTTGGAGAACAATCTGATCCCCAATCTGACCAGCGCTAACCTCAAAGACACATTCTGGAAGGAGCTGTTTTTCATCTTTAAGACCACCCTGCAGCTACGGAATAGTGTCACAAATGGGAAAGAGGATGTACTGATCTCACCTGTGAAAAACGCCAAGGGGGAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCCTCAGGACTGTGACGCGAACGGGGCCTACCACATCGCGCTAAAGGGGTTAATGATTCTCGAACGTAATAATCTGGTGCGCGAAGAAAAAGACACAAAGAAAATTATGGCCATCAGCAACGTTGACTGGTTTGAGTACGTGCAGAAGCGTCGAGGAGTTTTGTAA SEQATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGATTTGAACTGAAGCCT IDCAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTTCGAGGAGGACAGGGATAGAGCTGAGAAANO:TACAAGATCTTGAAAGAGGCCATCGACGAGTATCACAAAAAATTCATCGATGAGCATCTCACCAACATG149TCGCTGGATTGGAACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAGGAGAAAGATAAAAAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTTAAGAAAGATGATCGGTTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGCTGAAGGAAGAGATCTATAAAAAAGGCAATCACCAAGAAATTGATGCCCTGAAATCATTCGACAAATTCAGTGGGTATTTCATAGGACTGCATGAGAACCGGAAGAATATGTATAGTGATGGAGACGAGATCACAGCCATAAGCAATCGAATCGTTAACGAGAATTTCCCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGGAAAAAGTACCCTGAGTGGATCATCAAGGCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGATGAAGTCTTTAGTCTTGAGTACTTTAATAAAGTCCTTAACCAGGAGGGCATCCAGCGCTATAACCTGGCTCTCGGTGGCTACGTCACAAAAAGCGGAGAAAAGATGATGGGTCTCAACGATGCACTGAATTTGGCTCATCAGTCGGAGAAGTCATCTAAGGGACGCATACACATGACACCACTGTTTAAACAAATCCTGAGCGAAAAGGAATCATTTTCCTACATTCCCGACGTATTCACCGAGGACTCACAACTGCTGCCTAGTATAGGGGGGTTTTTCGCTCAGATAGAGAACGACAAAGATGGCAACATTTTTGACAGAGCCTTGGAGTTGATTTCATCTTACGCCGAGTACGATACGGAGCGCATTTATATTCGCCAGGCGGATATCAACAGGGTTTCCAATGTGATCTTTGGCGAGTGGGGAACGCTGGGCGGGCTGATGCGGGAATACAAAGCCGACTCGATCAATGACATCAACCTGGAGAGAACATGCAAGAAGGTCGATAAATGGTTGGATAGCAAAGAGTTCGCCCTGAGTGACGTCTTGGAAGCTATCAAAAGAACCGGAAATAATGACGCGTTCAACGAGTATATCTCTAAAATGAGGACCGCGAGAGAAAAAATTGATGCAGCAAGGAAGGAGATGAAGTTTATATCTGAGAAGATCTCAGGCGATGAAGAGTCCATCCATATTATTAAAACTCTTCTGGACTCAGTGCAGCAATTCCTGCACTTTTTTAACCTCTTCAAGGCCAGGCAGGATATACCGTTAGACGGGGCTTTTTATGCCGAGTTTGATGAAGTTCATTCGAAACTTTTTGCTATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGACAAAGAATAACTTAAACACAAAGAAAATCAAGCTCAACTTCAAAAACCCAACACTGGCAAACGGATGGGATCAGAACAAGGTATATGATTACGCCTCATTGATTTTCCTCCGGGACGGGAATTACTATCTGGGGATCATCAACCCTAAGCGCAAAAAGAACATTAAGTTCGAACAGGGATCTGGCAATGGTCCCTTCTATAGGAAAATGGTATACAAACAGATTCCTGGCCCCAACAAGAATCTCCCACGCGTCTTTCTGACGTCCACTAAGGGAAAGAAGGAGTACAAGCCGTCTAAAGAAATTATCGAGGGCTATGAGGCAGACAAGCATATTAGGGGTGACAAGTTTGACCTAGACTTTTGTCATAAGCTTATCGACTTTTTCAAGGAGTCCATAGAGAAGCACAAAGATTGGTCAAAGTTTAATTTCTATTTTTCTCCAACAGAGTCCTACGGGGATATCTCTGAGTTCTATCTGGATGTTGAAAAGCAGGGGTACAGAATGCACTTCGAAAATATCTCAGCAGAAACTATCGATGAGTACGTAGAGAAAGGAGATCTGTTTCTTTTCCAAATCTACAATAAGGATTTTGTGAAGGCCGCCACTGGGAAGAAGGACATGCACACTATTTACTGGAACGCTGCATTTTCCCCTGAAAATCTGCAGGACGTAGTAGTGAAATTAAATGGTGAGGCAGAACTGTTTTACCGCGATAAATCAGACATCAAGGAAATAGTGCACCGGGAAGGCGAGATTCTTGTTAACCGAACATATAATGGCAGGACACCTGTCCCTGATAAAATTCATAAGAAACTGACCGATTACCACAACGGTCGAACCAAGGATCTGGGCGAGGCCAAGGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCATTATGACATCACCAAGGACCGAAGATACCTTAACGACAAAATCTACTTCCATGTCCCACTCACCTTGAACTTCAAAGCTAACGGTAAGAAGAACCTCAATAAAATGGTGATTGAAAAATTTCTGTCCGATGAGAAGGCCCATATCATCGGCATTGATCGCGGCGAGAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGATTATCGACCAACAATCACTGAATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGGAAATCGAGATGAAGGACGCGCGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGATTTGAAAGAAGGTTACCTCTCCAAAGCAGTGCACGAAATTACCAAAATGGCAATCCAGTACAATGCTATTGTGGTAATGGAGGAGTTAAATTACGGATTTAAGCGCGGGAGGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAACATGTTGATTGATAAGATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTTAAATGCATATCAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGACTGGCATTCTCTTTTACGTCCCTGCAGCGTATACATCCAAAATTGACCCCACCACTGGCTTCGTCAATCTGTTTAACACCTCCTCCAAAACCAACGCACAAGAACGGAAAGAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGCCAAAGACGGCGGGATCTTTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCACAAGAACGTCTGGACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAGAGGAATGAACTTTTTGACCCTTCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCGGTATCAAATACGATGGCGGGCAGAATATTTTGCCAGATATCCTCAGATCGAACAATAATGGACTTATCTATACTATGTACTCCTCCTTCATTGCAGCAATTCAAATGAGAGTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAACTCCAAAGGCGAATTCTTCCGCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAACGGCGCCTATAATATAGCCCTGCGGGGTGAATTAACAATGCGCGCTATTGCCGAGAAGTTCGACCCCGATTCAGAAAAAATGGCTAAGCTTGAGCTGAAACACAAAGATTGGTTCGAATTCATGCAGACAAGAGGCGACTAA SEQATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACTGTAAGGTTTGAGCTGAID AGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCAAGAATGAGGGTCTCAAACGGGTAGTTAGTGNO: AGGATGAGAGGAGAGCAGTGGACTATCAGAAGGTGAAAGAGATCATCGATGACTATCACCGGGATTTC150ATAGAGGAGTCGTTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACAGGCCTTTCATCTTTACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAGAGTGGGAAGCCCTGCAGAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGACAGCAACAAAGCCAGGTTCAGTAGGATCGATAAGAAAGAACTGATCAAAGAAGACTTGATCAATTGGCTGGTTGCACAGAACCGGGAAGATGATATTCCCACCGTAGAGACCTTCAACAACTTCACAACTTACTTCACCGGCTTCCATGAGAATCGTAAAAACATCTACAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGATCCACGAGAATCTCCCCAAGTTCTTTGACAACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCCCGAACTCAAGTTCGATAAAGTGAAGGAGGATCTGGAAGTGGATTATGACCTTAAGCACGCTTTCGAGATAGAGTACTTCGTGAACTTTGTGACTCAGGCCGGCATCGATCAGTATAACTACCTCCTCGGGGGTAAGACGCTCGAGGACGGTACTAAGAAGCAAGGAATGAATGAGCAAATTAATCTATTTAAACAGCAGCAGACCAGGGATAAGGCTAGACAGATCCCCAAGCTTATTCCTCTTTTTAAACAGATCCTAAGTGAAAGGACAGAAAGTCAAAGCTTCATACCTAAGCAATTTGAAAGTGATCAGGAGCTGTTTGACTCCCTGCAAAAGCTGCACAACAATTGCCAGGACAAGTTTACCGTGCTGCAGCAGGCTATCCTCGGACTGGCTGAGGCGGATCTTAAGAAGGTATTCATTAAGACTAGCGACCTCAATGCCCTTAGTAACACCATCTTTGGAAATTACTCCGTTTTCAGCGATGCCCTCAATCTATACAAAGAGAGCTTGAAGACTAAAAAAGCTCAGGAAGCTTTTGAAAAATTACCGGCACATTCTATACACGACCTTATACAATACTTAGAGCAGTTCAACAGCAGCCTCGACGCTGAGAAACAGCAATCCACAGACACCGTCCTGAATTACTTCATCAAAACCGATGAACTGTACTCCCGATTTATCAAGAGCACTTCAGAAGCCTTCACGCAAGTTCAGCCTCTGTTCGAGCTGGAGGCACTGTCCAGCAAGAGACGACCGCCAGAGTCTGAAGACGAGGGAGCCAAGGGTCAAGAGGGGTTTGAACAGATAAAGCGAATTAAGGCTTACTTGGATACTCTCATGGAGGCGGTGCATTTCGCTAAGCCTTTGTACCTGGTTAAAGGCCGAAAAATGATTGAGGGGCTAGATAAGGATCAGTCTTTTTACGAGGCTTTTGAAATGGCCTACCAGGAATTGGAATCCTTGATCATTCCAATCTATAATAAAGCCCGGAGTTATCTGAGCAGGAAGCCCTTCAAAGCCGACAAGTTCAAAATAAATTTTGACAATAATACGCTACTGTCTGGTTGGGACGCTAACAAGGAAACAGCCAATGCTTCCATCCTGTTTAAGAAAGACGGCCTGTACTACCTGGGAATTATGCCAAAAGGCAAAACTTTTTTGTTCGATTACTTTGTGTCATCAGAGGATAGCGAGAAGTTAAAGCAAAGACGGCAGAAGACCGCCGAAGAAGCCCTCGCACAAGACGGAGAATCATATTTCGAGAAAATTCGATATAAGCTCCTGCCTGGCGCATCAAAGATGTTGCCAAAAGTCTTCTTTTCCAACAAAAACATCGGCTTTTATAACCCCAGCGATGATATCCTTCGCATCCGGAACACCGCCTCACATACCAAAAATGGAACTCCACAGAAGGGCCACTCGAAGGTTGAATTCAACCTTAACGATTGTCACAAAATGATTGATTTTTTTAAGAGCTCCATTCAGAAACACCCCGAATGGGGGTCCTTTGGCTTCACCTTTTCTGATACTTCAGACTTCGAGGACATGTCCGCCTTCTACAGGGAGGTGGAGAACCAGGGCTATGTCATCTCCTTCGACAAAATAAAAGAGACATACATTCAGAGCCAGGTCGAGCAGGGAAATCTGTACCTGTTTCAGATCTATAACAAGGATTTCAGTCCCTATAGCAAGGGCAAGCCCAATTTACATACCCTGTACTGGAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATGTAGTTGCTAAGCTGAATGGGGAAGCAGAGATCTTCTTCCGAAGGCACAGCATCAAGGCAAGCGACAAAGTTGTACATCCTGCTAACCAGGCCATCGATAACAAGAACCCGCATACAGAAAAGACACAGTCAACCTTTGAATACGACCTCGTGAAGGACAAGAGGTACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGAATTTTAAAGCGCAGGGAGTGAGCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGACGTGAATATCATCGGAATTGATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGGTGAACCAGAAGGGTGAGATCCTAGTACAGGAGAGCCTGAACACCCTTATGAGTGATAAGGGCCATGTGAATGATTACCAGCAGAAGCTGGACAAGAAGGAACAGGAAAGGGACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAAAGAACTGAAGGAGGGATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGCCATAGTCTGCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTGGAGAAGCAGGTGTACCAGAAGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACCTTGTCTTTAAGGAAAAAGAACTCGGCGAAGTTGGCCACTATTTAACCGCCTACCAACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAAGCAGAGCGGAATACTCTTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTGTAAACTTTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGATTTTAACGCCATCCGATTTAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGACTATAAAAAACTTACCCCCAAGAGAAAGGTTGGGACGCAGTCTAAGTGGGTAATCTGCACTTACGGTGACGTGAGATACCAGAACCGCCGAAACCAGAAAGGTCATTGGGAAACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCGCTAGCGACAGTAAAACAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCTTGGAACAAGACAAAGCCTCTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTACAATGACCCTTAGGCACAGCAAGATCAAATCAGAGGATGACTTCATCCTCAGCCCGGTGAAGAATGAACAGGGAGAGTTCTACGATTCACGGAAGGCTGGAGAGGTGTGGCCCAAGGATGCCGACGCGAACGGGGCCTACCACATAGCTCTAAAAGGTCTGTGGAACCTGCAACAAATCAATCAATGGGAGAAAGGTAAGACACTGAACCTGGCCATCAAAAATCAAGATTGGTTCTCATTCATCCAGGAAAAGCCTTATCAAGAGTGA SEQATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATCCATTATCTAAGACA IDCTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAACCTGGAGGCGTCGGGCTACCTGGCTGAGNO:GACAGACATCGCGCAGAATGCTATCCGAGAGCTAAGGAGCTTTTGGACGACAATCATCGCGCGTTCCTT151AACCGGGTGCTCCCACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCAAGGTCCATAAGAACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGACGGAAAGAGATTTCTGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCGCAAAGCCCGCATTGGATGAGGCCATGAAAATCGCCAAGGAGAACGGGAATGAAAGTGACATCGAAGTTCTCGAAGCGTTTAACGGATTTAGCGTGTACTTTACCGGCTATCATGAGTCAAGGGAGAATATTTATAGCGATGAGGACATGGTCTCTGTGGCCTACCGGATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAATATTCGACAAGTTAAATGAGAGCCACCCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTTGACGATATTGGCAAATACTTCGACGTGAGCAACTATAACAACTTCCTCTCACAGGCTGGCATCGACGACTATAATCATATTATAGGCGGCCACACTACTGAGGATGGTCTCATTCAGGCATTCAATGTAGTCTTGAATCTTAGGCACCAGAAGGACCCTGGGTTTGAAAAGATACAGTTCAAGCAGCTGTATAAGCAGATATTATCCGTGCGAACATCTAAAAGTTACATCCCCAAACAGTTTGATAACTCAAAGGAGATGGTGGATTGCATATGCGATTATGTGTCAAAAATTGAAAAGAGCGAGACTGTGGAGCGGGCTCTGAAGCTCGTCAGGAACATTAGCTCCTTTGACCTTAGAGGAATTTTCGTCAATAAAAAGAATCTGAGGATCCTGAGCAATAAGCTAATAGGAGATTGGGACGCCATAGAGACAGCATTGATGCATTCCAGCTCAAGCGAGAATGATAAGAAGTCTGTCTACGATAGCGCTGAAGCCTTCACGCTGGACGATATCTTCTCTTCCGTGAAAAAATTTAGTGATGCGTCCGCAGAAGATATCGGGAATCGAGCCGAAGATATCTGCAGGGTAATTTCAGAGACCGCCCCTTTCATCAATGACCTGCGCGCCGTGGACCTGGATAGCCTGAATGACGATGGTTACGAAGCTGCAGTTTCTAAGATCAGGGAGTCTCTGGAGCCATATATGGACTTGTTTCACGAACTTGAGATCTTTAGCGTGGGCGACGAGTTCCCGAAATGCGCAGCTTTCTATAGCGAGTTAGAGGAGGTCAGCGAGCAATTAATCGAGATCATACCCCTGTTTAATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTACAGCACCGACAAGATTAAAGTTAATCTGAAATTTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGAACGCGATAATAAAGCCGCCATCCTTAGAAAGGACGGAAAGTACTATCTTGCCATCCTAGATATGAAAAAAGATCTGAGTTCCATTCGTACTAGCGATGAAGACGAATCTTCTTTCGAAAAAATGGAGTATAAGCTGCTCCCCTCGCCAGTCAAGATGCTACCCAAGATCTTTGTGAAGAGCAAAGCAGCCAAGGAAAAGTACGGGCTGACGGACAGGATGCTGGAGTGCTACGATAAGGGAATGCATAAATCAGGGTCAGCTTTTGACTTGGGCTTTTGCCATGAGCTAATCGATTACTACAAGCGCTGTATCGCCGAGTATCCAGGATGGGACGTTTTCGACTTTAAATTTCGGGAGACTTCTGATTATGGTTCAATGAAGGAGTTCAACGAAGATGTCGCTGGTGCCGGTTACTACATGAGCCTTCGCAAGATTCCTTGTTCCGAAGTCTACCGGCTACTGGACGAGAAATCTATATATTTGTTCCAGATATATAACAAGGACTACAGTGAGAATGCACATGGGAATAAGAATATGCATACTATGTATTGGGAAGGTCTCTTTTCACCCCAAAATTTGGAGTCACCCGTGTTCAAACTTAGCGGTGGCGCAGAGCTGTTCTTTAGGAAATCCAGTATACCCAATGACGCCAAGACAGTCCACCCAAAGGGTAGCGTCCTGGTGCCCAGAAACGATGTGAACGGCAGGAGAATCCCTGACAGCATTTACCGAGAACTTACCAGGTACTTCAACCGCGGCGACTGTAGAATCTCTGATGAGGCAAAGTCTTATCTGGATAAGGTGAAGACTAAGAAGGCAGATCATGACATTGTGAAAGACCGCCGCTTTACTGTCGACAAAATGATGTTTCACGTGCCTATCGCAATGAATTTTAAGGCAATCTCAAAACCGAATCTGAACAAGAAGGTGATAGATGGCATTATCGATGACCAGGACCTCAAGATCATCGGAATCGACAGAGGTGAGCGAAACCTGATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACCAGGATTCACTAAACATCCTCAATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACAACAAGGAGGCGCGACGGAATTGGACAAAGGTGGAAGGCATACGGAAGATGAAGGAAGGCTATCTGTCACTAGCTGTCTCCAAATTGGCTGATATGATTATAGAGAACAACGCCATTATCGTGATGGAAGATCTCAACCATGGATTCAAGGCAGGAAGAAGTAAAATTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTAATAAGTTGGGTTATATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTATCAGCTGGCTAACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGATTTTCTACATTCCTGCTGCGTTCACTTCTAAGATCGATCCTACCACGGGATTCGCAGACCTGTTCGCACTGAGCAATGTTAAAAACGTGGCCTCCATGAGGGAGTTCTTTAGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGGCAAGTTCGCTTTCACATTTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGACCGTATACACGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGTAAAGTGCCAACTGACATCATCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTGAGGGGGATCTAAGGGACAGGATCGCTGAATCGGATGGCGATACTCTCAAATCAATCTTCTACGCCTTCAAGTATGCCCTCGACATGAGGGTAGAGAACCGGGAGGAGGACTATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGCTCAAAAAACGCCGGGAAATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCTCAAAGGCATCCTGCAGCTGCGCATGCTATCTGAACAATATGACCCAAACGCTGAAAGCATTAGATTGCCATTGATCACCAATAAGGCTTGGCTGACTTTCATGCAGAGCGGTATGAAGACATGGAAAAACTAA SEQATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGATTTGAACTCAAGCCCGID TGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTTGAAGGAAGACGAGCATCGGGCGGAAAGTNO:TACAGACGGGTGAAGAAGATTATAGATACTTATCACAAGGTCTTTATAGACAGCTCTTTAGAGAACATG152GCAAAGATGGGCATCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTGTATAAAAAGGATCATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCGCGGCCTCATTGTGGGTGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGCAGAATGAGAAATACGAGAGTTTGTTCAAAGAGAAACTCATCAAGGAAATCCTGCCCGACTTCGTCTTAAGCACAGAAGCCGAATCTCTCCCATTTTCTGTCGAGGAGGCCACGCGTTCCCTTAAAGAGTTCGACAGTTTCACTTCATACTTTGCCGGATTTTATGAAAACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAATAGCTTACAGGTTAATCCACGAAAACCTGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAAATCAAGGAACCAATCGCGAAGGAACTTGAACACATCCGGGCTGACTTTAGTGCGGGAGGATACATCAAAAAAGACGAGCGCCTGGAGGATATATTTTCACTAAATTATTATATTCATGTACTGAGCCAGGCTGGCATAGAAAAGTACAACGCTCTAATTGGGAAAATCGTGACAGAAGGTGACGGGGAAATGAAAGGGCTAAACGAACATATTAACTTATATAACCAACAGCGGGGTCGAGAAGATCGTCTGCCCCTGTTCAGACCTCTGTATAAGCAAATACTCTCCGACAGAGAGCAGCTATCATATCTGCCCGAGTCCTTTGAGAAAGATGAAGAGCTGCTCCGGGCGCTCAAGGAGTTCTATGATCATATAGCCGAGGACATTTTGGGCAGAACTCAGCAACTCATGACGTCTATTTCTGAATATGATCTGTCTCGTATCTATGTCAGGAATGATAGCCAGCTGACCGATATATCCAAGAAGATGCTGGGGGACTGGAACGCCATTTATATGGCGAGGGAGCGAGCATACGATCACGAGCAGGCACCCAAGAGAATCACAGCCAAATATGAGAGAGACCGCATTAAGGCGCTGAAGGGCGAAGAAAGTATCAGTCTGGCCAATCTGAACTCCTGCATAGCTTTCCTTGATAACGTGAGGGATTGCAGAGTTGATACTTACCTGAGTACCCTGGGCCAGAAGGAAGGGCCTCACGGCCTCTCTAATCTAGTGGAGAATGTATTTGCCTCCTACCACGAAGCTGAGCAGCTGCTGTCATTTCCGTACCCAGAGGAAAATAATTTAATACAGGATAAGGACAACGTAGTGCTTATCAAAAATCTACTGGATAACATTTCCGACCTCCAGCGCTTTCTCAAACCACTTTGGGGGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTTACGGCGAGTACAACTACATCAGGGGCGCCTTGGACCAGGTGATTCCCCTCTATAATAAAGTCAGGAATTACCTGACCCGAAAGCCATACAGTACAAGAAAGGTGAAATTAAATTTCGGCAATAGTCAGCTGCTGTCTGGTTGGGACCGAAATAAGGAGAAAGACAACAGCTGCGTAATTCTCAGAAAAGGACAGAACTTTTATTTGGCCATCATGAATAACAGACACAAGAGATCTTTCGAGAACAAAGTGCTCCCTGAGTATAAGGAGGGGGAACCCTACTTCGAGAAGATGGACTATAAATTCCTTCCTGATCCAAATAAAATGCTGCCTAAAGTATTTCTGTCAAAAAAAGGTATAGAAATCTACAAACCTTCACCTAAGCTACTTGAACAGTATGGCCACGGCACCCATAAAAAAGGGGACACGTTCAGCATGGACGACCTACACGAACTGATTGACTTCTTTAAGCACAGCATAGAAGCTCATGAGGACTGGAAACAGTTCGGATTCAAATTCTCAGATACCGCGACCTACGAAAACGTGTCTAGTTTTTACCGGGAAGTCGAGGACCAGGGCTACAAGCTCAGCTTCAGAAAAGTTAGCGAATCTTACGTCTACTCCCTTATAGATCAAGGTAAGCTGTATCTCTTTCAAATCTACAACAAGGACTTTTCCCCATGTAGCAAGGGCACCCCCAATCTGCACACTCTCTACTGGCGGATGCTGTTCGACGAGCGTAACCTGGCAGACGTGATCTACAAATTAGATGGTAAAGCTGAGATCTTCTTTCGTGAAAAGAGCCTAAAGAACGATCACCCCACTCACCCCGCCGGAAAGCCCATTAAGAAGAAAAGTAGGCAGAAGAAAGGAGAAGAATCGCTATTTGAGTACGACCTCGTCAAGGATCGGCATTATACAATGGATAAGTTCCAGTTCCATGTGCCAATAACTATGAATTTCAAGTGCAGTGCTGGCAGTAAGGTGAATGACATGGTAAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATTGGAATTGATAGGGGTGAGCGTAATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATCAGATTTCCTTGAATACAATTAATGATATAGACTACCATGACTTGCTTGAGTCTCGCGACAAAGATAGACAGCAGGAGAGAAGAAATTGGCAGACCATCGAAGGCATCAAGGAACTCAAGCAAGGCTACCTTTCTCAGGCAGTGCATCGAATAGCCGAGCTGATGGTGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATATGGGATTTAAACGAGGCAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGATAGACAAACTCAATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGCTTAGAGCGTATCAATTTACAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACAGAACGGTTTTCTGTTTTACATCCCAGCGTGGAACACCAGCAATATAGATCCAACCACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGAAAATGTGGACAAGGCCAAATCCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACTGGTTTGAATTCGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATGTGGATATTGTGCACGCATGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAAACGGGCAGTGGGACTCAGAAGAATTCGCACTCACCGAGGCTTTCAAATCCCTCTTCGTCCGGTATGAGATCGATTACACCGCCGATCTGAAGACGGCAATCGTCGACGAGAAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGCTAACCGTTCAGATGCGAAACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTGGCGCGGATGGTAGATTTTTCGATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGACGCCGACGCGAATGGTGCCTACAATATTGCACTAAAGGGGCTCTGGGCGCTGCGGCAAATTAGACAGACATCTGAAGGGGGCAAGCTTAAGCTGGCTATTTCTAATAAAGAGTGGTTGCAGTTTGTGCAGGAAAGGAGTTATGAGAAGGACTAG SEQATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGAAAACACTTAGGAAT IDGCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAAGAATGGGATCATCAAAGAGGACGAATTGNO:CGCGGGGAGAATAGGCAGATCCTGAAGGACATCATGGACGATTACTACAGGGGTTTTATCTCCGAAACG153CTGAGCTCGATTGACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAACTTAAAAATGGCGATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACACAAAAAATTCGCAAATGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTAGCGATATTCTGCCCGAGTTTGTGATTCACAATAATAACTATTCCGCTTCGGAGAAGGAGGAAAAGACTCAGGTGATTAAACTGTTTTCTCGGTTCGCCACTTCTTTCAAAGATTATTTCAAAAATCGCGCCAACTGTTTTTCCGCTGACGACATCTCCTCCTCTTCCTGCCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAACGCCCTTGTGTATCGGAGGATAGTGAAGAGCCTGTCCAATGATGACATAAACAAAATTTCTGGCGATATGAAGGATAGCCTCAAAGAGATGAGCCTTGAAGAAATTTACTCCTACGAGAAGTATGGGGAGTTCATCACCCAGGAGGGGATTTCCTTCTATAATGACATCTGTGGCAAGGTGAACAGCTTCATGAACCTGTACTGCCAGAAGAATAAGGAAAACAAAAATCTGTACAAGCTTCAGAAGTTACATAAGCAGATCCTGTGTATCGCGGATACCTCATATGAGGTTCCTTATAAGTTCGAGAGTGATGAAGAAGTGTACCAGTCTGTAAATGGATTCTTAGACAATATTTCGTCCAAACATATAGTGGAGAGACTGAGAAAGATCGGGGACAATTACAATGGGTACAATCTCGACAAGATTTATATCGTGTCGAAGTTTTACGAATCTGTGAGCCAGAAAACATACAGGGATTGGGAAACCATTAATACCGCGCTTGAAATTCACTACAATAATATTCTGCCTGGCAACGGAAAAAGCAAGGCCGATAAGGTAAAAAAGGCAGTCAAAAATGACCTTCAGAAAAGTATCACCGAAATCAATGAGTTGGTGAGCAACTACAAATTGTGTTCAGACGATAATATTAAAGCGGAAACGTACATACATGAAATTAGCCATATTCTGAATAACTTTGAGGCGCAGGAACTTAAGTACAACCCTGAAATTCATCTCGTCGAAAGCGAATTGAAGGCCTCTGAATTGAAAAACGTTCTTGACGTGATAATGAACGCTTTCCATTGGTGCTCTGTGTTTATGACTGAAGAGCTGGTTGATAAGGACAACAACTTTTATGCTGAACTTGAGGAAATCTACGACGAGATCTACCCTGTGATTAGCTTGTATAACCTCGTCAGAAACTACGTTACCCAGAAGCCGTACAGCACGAAAAAAATAAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGATGGAGCAAGTCGAAGGAATATTCCAACAATGCCATCATTCTTATGCGAGACAATCTGTATTACCTCGGCATCTTTAACGCCAAAAACAAGCCGGATAAGAAAATCATTGAAGGGAATACGAGCGAGAATAAGGGCGACTATAAGAAAATGATCTACAACTTACTGCCAGGTCCCAATAAAATGATTCCTAAGGTGTTTCTGTCATCGAAAACAGGTGTAGAAACATATAAGCCCAGCGCATACATCCTGGAAGGCTACAAGCAAAACAAACACATCAAAAGCAGCAAGGACTTTGATATCACATTCTGCCACGATCTAATCGACTACTTCAAAAATTGCATCGCCATTCACCCTGAGTGGAAGAACTTCGGCTTTGACTTCTCCGACACCAGTACCTACGAAGACATTTCTGGATTCTACCGTGAGGTTGAGCTGCAGGGTTATAAAATTGACTGGACATACATCAGTGAAAAAGACATCGATCTACTGCAGGAGAAGGGGCAGCTCTATCTCTTCCAGATTTATAATAAGGATTTCAGCAAGAAGTCCACTGGAAACGACAATCTGCATACAATGTATCTTAAGAACTTGTTTAGCGAAGAGAATTTGAAAGATATCGTTCTAAAGTTAAACGGGGAAGCCGAGATTTTCTTTCGAAAGTCTTCCATTAAGAATCCAATTATTCACAAGAAGGGCAGTATCCTGGTCAACAGAACCTATGAGGCCGAGGAAAAGGACCAGTTCGGTAATATACAAATTGTGCGCAAGAACATCCCCGAGAACATTTACCAGGAGCTCTATAAATACTTCAACGACAAAAGCGATAAGGAGCTTTCCGACGAGGCTGCCAAGCTGAAAAACGTGGTGGGACACCATGAAGCAGCCACCAACATCGTCAAAGATTATCGTTATACATATGACAAATATTTTCTGCACATGCCTATTACAATAAACTTTAAGGCAAACAAGACCGGGTTCATCAATGACCGGATACTCCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCATCGACCGCGGTGAAAGAAATCTCATTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGAAGTCCTTCAACATTGTTAACGGCTATGACTATCAGATCAAGCTCAAACAGCAGGAAGGTGCTCGTCAGATTGCGAGGAAAGAATGGAAAGAGATCGGCAAGATCAAGGAGATCAAAGAAGGGTATCTGAGCTTGGTCATTCACGAGATCTCCAAAATGGTCATCAAGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCTTTAAGAAGGGGCGCTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAGCTGAATTACTTGGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGAAAGGTTACCAGCTCACATATATTCCCGATAAGCTCAAGAATGTGGGCCACCAATGCGGTTGTATTTTTTACGTTCCAGCTGCCTACACATCTAAGATCGATCCTACCACCGGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTTGATGCCAAGCGTGAGTTTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTGTTTCACTTTCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCATGGAGCGTTTATACTTATGGTGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGTTTTCCAATGAAAGCGATACAATAGACATCACAAAAGACATGGAGAAGACACTGGAGATGACAGATATTAATTGGAGGGACGGGCATGACCTTAGACAGGACATCATCGACTACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCAGATGCGAAACAGCCTGTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTTAAACGAAAATAATATTTTCTACGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAAGATGCGGACGCCAATGGCGCTTACTGTATAGCGTTGAAGGGATTGTATGAGATTAAACAGATCACTGAGAATTGGAAAGAAGACGGTAAATTCTCCAGAGACAAGCTGAAAATCTCCAACAAAGACTGGTTTGATTTTATTCAAAATAAGCGCTACCTGTAA SEQATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAACTGATTCCACAAGGG IDAAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCAGGATAAACAACGCGCAGAAAGCTATCAANO:GAGATGAAGAAGACGATCGATAAATTTCATAAGTATTTCATAGATTTAGCCCTGAGCAACGCTAAATTG154 ACCCACCTGGAAACCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAGCAGAAGTTTAAGGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTTTCCGACGGAGACGCCAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCACTGTGGAGTTGGAGAAGTGGTTTGAGAATAATGAGCAGAAGGACATCTATTTTGACGAAAAGTTCAAGACATTTACTACTTACTTCACCGGATTTCACCAAAACCGGAAGAACATGTACTCTGTTGAGCCGAACTCAACCGCCATCGCCTACCGCCTTATTCACGAAAATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAATTAAGCAGGTCGAGTCGCTCCAGGTGAACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGCCTGATTTTCGTGAATGAACTCGAAGAGATGTTTCAGATCAACTACTATAATGATGTACTCTCACAGAACGGGATCACTATCTACAACAGCATTATCTCTGGATTCACTAAGAACGATATCAAGTATAAAGGGCTGAATGAATACATCAACAATTATAATCAGACTAAGGACAAAAAGGACAGGCTGCCTAAATTGAAACAGCTGTATAAGCAGATCCTCAGTGATAGAATTAGCTTGTCATTTCTCCCAGATGCCTTCACTGACGGAAAGCAGGTGCTTAAGGCGATATTCGATTTCTATAAGATCAACCTCCTCTCTTATACAATCGAGGGCCAGGAGGAGTCACAGAACCTCCTGCTCCTGATTCGACAAACTATTGAAAATCTGTCCTCTTTCGATACGCAGAAGATATACCTGAAAAATGACACCCATCTCACTACAATATCCCAACAGGTATTCGGAGATTTCTCCGTCTTCAGTACAGCCCTGAATTACTGGTACGAGACAAAGGTGAACCCTAAGTTCGAAACAGAGTACAGCAAGGCGAACGAAAAGAAGAGGGAGATCCTGGACAAAGCCAAAGCCGTTTTCACCAAGCAAGATTACTTTAGCATCGCATTTCTGCAGGAAGTCCTGTCTGAGTACATACTGACACTCGATCACACAAGCGACATAGTTAAGAAGCACTCTTCCAATTGTATCGCGGACTACTTCAAAAATCATTTTGTCGCGAAAAAGGAGAACGAGACAGATAAGACCTTCGATTTTATCGCGAATATTACCGCAAAGTATCAATGCATTCAGGGTATCTTGGAGAACGCCGACCAGTACGAAGACGAGCTTAAACAGGATCAGAAGCTCATCGACAACCTAAAGTTCTTTTTGGACGCTATACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGAAGAGTGAGTCTATCACTGAGAAGGACACTGCTTTTTACGACGTTTTCGAGAATTACTACGAAGCACTGTCTCTGCTAACCCCTCTGTATAACATGGTGAGAAACTATGTGACACAGAAACCTTATAGTACCGAGAAGATTAAGTTGAACTTCGAGAACGCACAATTGCTGAATGGGTGGGATGCAAACAAAGAGGGTGATTACCTCACAACAATCCTCAAGAAAGATGGCAATTACTTCCTGGCCATTATGGATAAAAAACATAACAAGGCATTTCAGAAATTTCCCGAGGGGAAGGAAAATTATGAAAAGATGGTATACAAGTTGCTGCCCGGGGTGAACAAAATGCTCCCGAAGGTGTTTTTCTCGAATAAGAATATCGCGTACTTTAACCCGTCCAAGGAACTGTTGGAAAATTATAAAAAGGAAACACACAAGAAGGGGGACACTTTTAATTTGGAGCACTGCCACACACTCATTGACTTCTTTAAAGATAGTCTCAACAAACATGAGGATTGGAAATATTTTGACTTTCAGTTTAGCGAGACCAAGTCTTATCAGGATCTGTCGGGATTTTATAGGGAAGTTGAGCACCAGGGTTACAAGATAAATTTCAAGAACATCGATAGCGAGTACATTGACGGACTGGTGAACGAAGGGAAGCTGTTCCTGTTTCAGATTTACAGCAAAGATTTCTCTCCTTTCTCAAAAGGCAAGCCGAACATGCATACCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAAACCTTCAGAATGTGATTTACAAGCTGAACGGTCAGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAAGCCCAAAAACATCATTCTGCACAAGAAAAAGATAAAGATCGCCAAAAAACACTTCATTGATAAAAAGACAAAGACTTCTGAGATCGTACCTGTTCAGACAATCAAGAATCTCAACATGTATTATCAGGGGAAGATTAGCGAGAAAGAGCTGACACAGGACGATTTGAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGACAATCGATATCATCAAGGACAAGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGATGAATTTCAAGGCCACCGGTGGGAGTTATATCAACCAGACTGTGCTGGAGTATCTGCAGAACAACCCCGAAGTAAAAATTATTGGCCTGGACAGAGGAGAGCGGCATCTGGTGTACTTGACCCTCATCGATCAGCAGGGAAATATCCTGAAACAAGAATCTCTGAATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCTGCTTGATAATAAAGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAGGAATTGAAAGAAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGTTAGAAGAGAACGCTATTGTGGTGATGGAGGACTTAAACTTCGGATTTAAAAGAGGAAGATTTAAAGTCGAGAAACAGATTTATCAGAAACTGGAAAAAATGCTCATTGACAAATTAAATTACCTGGTGCTGAAAGATAAACAGCCACAGGAGCTGGGTGGCCTGTATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGGGCAAGCAGTCAGGCTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACAACAGGCTTTGTGAATTATTTCTATACCAAGTATGAAAACGTGGACAAGGCAAAGGCCTTTTTCGAGAAGTTTGAAGCAATCAGGTTCAATGCCGAGAAAAAATACTTTGAGTTCGAGGTCAAAAAATATAGCGACTTCAACCCTAAGGCCGAAGGCACGCAACAAGCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAGCGGCAGAAGGATCAGAATAACAAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGACTTCCTCGGCAAGAATCAGATCGTGTACGGCGACGGCAATTGCATCAAGTCACAAATTGCATCTAAAGATGACAAAGCATTCTTCGAAACACTGCTGTATTGGTTCAAGATGACACTCCAGATGCGAAATAGCGAAACAAGAACAGATATTGACTACCTCATCAGCCCTGTGATGAATGATAACGGCACGTTTTACAATTCCCGGGACTATGAAAAATTAGAGAACCCGACACTGCCAAAAGACGCCGACGCAAATGGTGCATATCACATCGCAAAGAAAGGTTTGATGCTGTTGAACAAAATTGATCAGGCTGATCTGACAAAAAAGGTCGATCTGAGTATCAGTAACCGCGACTGGTTGCAGTTTGTCCAGAAGAACAAATAA SEQATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGGCAGTCACCGACTCA IDGAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGGAGTGCGACTCTTCGAGTCCGCTAGTACTNO: GCTGAAGAGAGGAGGATGTTTCGAACTTCCAGGCGCAGGCTGGATCGGCGAAACTGGAGAATAGAGAT155TCTCCAGGAGATATTTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGCATGAAAGAATCTAAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCCTATGCTCTGTTTGTCGACGACGATTTCACCGACAAAGATTACCACAAGAAATTCCCCACCATATACCACCTGAGAAAGATGTTGATGAACACCGAGGAGACACCCGACATACGTCTGGTTTACCTGGCTATCCATCATATGATGAAGCACCGCGGGCATTTCCTGCTGTCTGGAGACATCAATGAGATAAAGGAATTTGGTACTACGTTCTCCAAGTTGTTAGAAAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGGGAAAGGAAGAGTATGCAGTTGTAGAGTCGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAGAAAACCAGGTTAATTAAGGCACTAAAGGCCAAATCGATATGCGAGAAGGCTGTGCTAAATCTGCTGGCTGGAGGCACCGTGAAACTGTCTGATATTTTCGGCCTGGAAGAGCTCAATGAAACCGAGCGGCCTAAAATTTCTTTCGCCGATAACGGATACGATGACTATATTGGGGAGGTGGAAAACGAGCTCGGAGAACAATTCTACATTATTGAAACCGCTAAGGCAGTCTATGACTGGGCCGTGCTCGTCGAGATTTTAGGCAAGTACACCAGCATTAGCGAAGCAAAGGTGGCTACCTATGAAAAGCACAAATCTGACCTCCAGTTTCTGAAAAAGATTGTGCGCAAATACTTAACAAAAGAAGAGTACAAGGACATCTTTGTGAGCACATCAGATAAGCTCAAGAATTACTCAGCATACATTGGAATGACAAAGATTAACGGGAAGAAGGTGGATCTCCAAAGCAAACGTTGTTCAAAGGAGGAGTTTTACGATTTCATAAAGAAGAACGTGCTGAAGAAACTGGAGGGACAACCGGAGTACGAGTATTTAAAGGAGGAGCTCGAGCGAGAAACTTTCCTGCCCAAGCAAGTGAACAGAGACAATGGTGTCATTCCTTACCAGATTCACTTATATGAGCTGAAGAAAATCCTGGGGAACTTGAGAGACAAGATAGACCTCATCAAGGAAAATGAAGATAAGTTGGTCCAGTTGTTCGAATTCAGAATCCCATATTACGTCGGCCCGCTCAATAAGATCGACGACGGCAAGGAAGGCAAATTCACTTGGGCGGTGCGAAAAAGCAACGAAAAAATATACCCATGGAACTTTGAGAACGTCGTTGACATCGAGGCCAGCGCCGAGAAATTTATAAGACGCATGACTAATAAGTGTACTTACCTCATGGGCGAGGATGTTCTGCCCAAGGACAGCCTGCTGTATTCCAAGTACATGGTGCTTAACGAGCTGAATAATGTAAAGTTAGATGGTGAGAAGCTCAGCGTGGAGCTTAAACAGAGGCTGTACACTGATGTGTTTTGCAAGTATCGGAAAGTTACCGTTAAGAAGATAAAGAATTACCTGAAATGCGAAGGGATCATTTCCGGCAACGTGGAAATTACCGGAATCGACGGCGATTTTAAGGCGTCGTTGACCGCTTATCATGATTTCAAGGAGATTTTAACCGGCACGGAGCTCGCGAAGAAAGACAAGGAGAACATAATCACGAATATAGTTCTGTTTGGGGACGATAAAAAACTTCTTAAAAAACGACTCAATCGACTGTATCCGCAGATTACCCCCAACCAGCTGAAGAAGATTTGCGCTCTGAGCTATACCGGGTGGGGCCGGTTCTCTAAGAAATTCCTCGAGGAGATCACAGCACCAGACCCAGAGACTGGTGAGGTGTGGAATATTATTACAGCTCTGTGGGAATCCAATAATAACCTTATGCAATTGTTGAGCAATGAATATAGGTTCATGGAGGAAGTGGAAACCTACAATATGGGCAAGCAGACAAAGACCCTATCTTACGAGACCGTTGAGAATATGTATGTCTCCCCTTCAGTGAAACGGCAAATCTGGCAAACTTTGAAGATCGTGAAGGAGCTCGAAAAGGTGATGAAAGAGAGCCCGAAGAGGGTTTTTATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGAGAACAGAGTCTAGGAAGAAGCAGCTAATCGATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGGGTCAAGGAGCTAGGCGATCAGGAAGAACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTATACCCAGAAAGGACGGTGCATGTACTCAGGTGAGGTCATTGAGCTGAAAGATCTGTGGGACAATACTAAGTATGATATTGATCACATCTACCCTCAGTCAAAAACTATGGACGACTCCCTCAACAACAGGGTGTTGGTTAAGAAGAAATACAATGCTACAAAGTCCGATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTCTGGAAGTCCCTGCTGGATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAATACCGAGCTCTCCCCCGAGGAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAGACTAGGCAATCTACAAAGGCAGTCGCTGAGATCCTGAAGCAAGTGTTTCCTGAGTCAGAAATCGTGTACGTCAAAGCTGGCACAGTGTCACGGTTCCGAAAGGACTTTGAGTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAGACGCCTATCTGAATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGTTTATTAAGGAGAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGGCTGGAACATAGAACGGAACGGAGAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACCATTGTGACCGTCAAACAGATTATGAACAAAAACAACATATTGGTAACTCGCCAGGTGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATTATGAAAAAGGGCAAAGGACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAGTACGGAGGCTACAACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGAAAGGCAAAACTATTAGAACAATCGAGTTCATCCCTCTATATTTGAAAAATAAGATCGAAAGTGACGAAAGCATCGCCCTTAACTTCTTGGAGAAGGGCCGGGGCTTAAAGGAACCAAAGATTCTGCTCAAGAAGATCAAGATCGACACACTCTTCGATGTGGATGGTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCTTGCTGTTCAAATGCGCAAATCAGTTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCGTTAAATTCATTCAGCGGAGACAGGAAAACAGAGAACTGAAACTCTCCGATAAGGATGGAATTGACAATGAAGTCCTCATGGAGATTTACAATACCTTTGTGGACAAGCTTGAGAACACAGTCTATCGGATCCGACTGTCCGAACAGGCAAAGACTCTGATCGACAAACAGAAAGAATTCGAAAGACTAAGCTTAGAGGACAAAAGTTCAACTCTCTTTGAAATTCTCCACATCTTCCAATGTCAAAGTAGTGCAGCCAACTTGAAGATGATCGGGGGTCCCGGCAAGGCTGGAATCTTAGTCATGAACAACAACATCTCCAAATGTAACAAAATCTCCATCATAAACCAGTCTCCCACCGGCATTTTCGAGAACGAAATTGATTTACTCAAG SEQATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGTTTGAAATGAGGCCGGID TGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTTTGAAAAGGACAAATTAATCCAGAAGAAGTNO:ATGGTAAAACAAAGCCATATTTTGACCGATTGCATCGGGAATTCATTGAAGAGGCTCTTACAGGAGTAG156 AATTGATCGGACTGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAGAAGAACAACGTGGCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAAGATCTTTAATCTTAAGGCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGACTGAAAAACAAAAACACAGACATCCTGTTTGAAGAAGCCGTCTTTGGTATCTTGAAGGCCAGGTATGGAGAGGAGAAAGACACGTTTATAGAGGTAGAGGAGATTGATAAAACAGGCAAGAGTAAGATTAATCAGATCAGTATCTTTGATTCTTGGAAGGGGTTCACAGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTATAAGAACGATGGCACCTCCACAGCTATCGCGACACGCATCATAGATCAGAATCTGAAACGGTTCATTGATAATCTGAGCATTGTTGAATCCGTGCGCCAGAAGGTCGACCTAGCTGAGACTGAGAAGTCTTTCTCTATATCACTCTCCCAGTTCTTCTCAATAGATTTTTATAATAAGTGCCTTCTGCAAGATGGCATAGACTACTATAACAAGATCATCGGCGGCGAAACTCTCAAAAACGGTGAAAAGCTCATTGGCCTGAATGAGCTCATCAACCAATATAGACAAAATAACAAGGATCAGAAAATCCCATTCTTTAAGCTGCTAGATAAACAGATCCTATCAGAAAAAATCCTGTTCCTCGACGAAATCAAAAACGACACCGAACTCATCGAGGCTCTCTCGCAGTTTGCCAAGACGGCTGAGGAGAAGACGAAGATTGTGAAAAAGCTGTTTGCAGACTTTGTGGAGAACAACTCTAAATACGATTTGGCTCAGATTTATATCTCCCAGGAAGCATTTAACACAATCTCCAATAAGTGGACTAGCGAGACTGAAACCTTCGCCAAATACCTGTTCGAGGCCATGAAAAGCGGCAAGCTCGCCAAATACGAGAAGAAGGACAATTCCTATAAGTTTCCCGATTTCATCGCATTATCTCAGATGAAGTCCGCGCTACTTAGCATTAGCCTGGAAGGCCATTTTTGGAAGGAGAAATACTATAAGATTTCCAAATTCCAAGAAAAGACCAATTGGGAGCAGTTCTTGGCTATTTTTCTATACGAGTTCAACTCTTTGTTCAGTGACAAGATCAACACTAAGGACGGTGAGACCAAACAAGTGGGGTACTACCTCTTCGCCAAAGATCTTCATAACCTGATACTGTCCGAACAGATCGACATACCCAAGGATTCAAAGGTGACCATCAAGGATTTTGCGGATTCGGTATTGACGATCTATCAGATGGCGAAGTATTTCGCTGTCGAGAAAAAGCGGGCATGGCTGGCCGAATACGAGTTGGACTCCTTCTATACTCAACCCGATACAGGGTACCTGCAGTTTTACGATAATGCATACGAGGATATAGTCCAGGTGTACAATAAACTCAGGAACTACCTCACTAAGAAACCATACTCCGAAGAAAAATGGAAACTTAATTTTGAGAATAGTACACTGGCCAATGGATGGGACAAGAACAAGGAATCAGACAACTCCGCTGTAATTCTCCAGAAGGGTGGCAAGTATTATCTGGGACTGATAACAAAGGGCCATAACAAGATTTTCGATGACCGTTTTCAGGAGAAGTTTATAGTGGGCATAGAGGGTGGCAAGTATGAAAAAATAGTCTACAAGTTCTTTCCCGATCAGGCGAAGATGTTCCCCAAAGTATGCTTCAGTGCTAAAGGCCTCGAGTTTTTCCGGCCATCTGAAGAGATACTCCGCATCTATAATAACGCAGAGTTTAAAAAGGGAGAGACGTACTCAATCGACTCGATGCAGAAACTCATTGACTTCTACAAAGATTGTCTCACAAAATACGAGGGCTGGGCTTGCTACACGTTTCGGCACTTGAAGCCAACCGAGGAATATCAAAACAACATCGGGGAGTTCTTCCGTGACGTCGCCGAAGACGGCTATAGAATTGACTTTCAGGGCATAAGTGATCAGTATATTCACGAGAAGAATGAGAAAGGTGAGTTGCATCTTTTCGAAATCCACAATAAAGACTGGAATCTTGACAAGGCTCGCGATGGAAAATCAAAGACTACCCAGAAGAATCTTCATACACTTTACTTCGAGTCCCTCTTTTCCAACGACAACGTCGTACAGAATTTCCCAATAAAACTGAACGGCCAGGCCGAAATTTTTTACAGGCCCAAAACCGAAAAAGATAAACTGGAATCCAAGAAAGACAAGAAGGGAAATAAGGTGATAGATCACAAAAGGTATTCCGAGAACAAGATTTTTTTCCACGTACCTCTTACCCTGAACAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAAACAACTTTCTCGCAAACAACAAAGATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTATATTATTCTGTGATCACGCAAGCATCCGATATCTTGGAGTCCGGTTCTTTGAACGAACTGAATGGTGTCAACTACGCCGAGAAACTCGGTAAGAAAGCTGAGAATCGGGAGCAGGCTAGAAGGGACTGGCAGGACGTTCAGGGTATCAAGGACCTGAAGAAGGGCTACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTAAGCACAATGCAATCATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAGAAATCAATTTACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGGTTGATAAGGGCGAAAAGAACCCCGAGCAAGCAGGACATCTCCTTAAAGCCTATCAACTGAGCGCACCTTTCGAGACATTCCAGAAGATGGGAAAGCAAACCGGCATCATTTTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGTGACTGGGTGGAGACCCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTAAGTTCACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAAGATTTCCAGCAGGCCAAGGAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCAATGTGGAGAGGTTTCGGTGGGACAAGAATCTGAATCAGAATAAAGGGGGATATACTCACTACACCAACATTACCGAGAACATCCAAGAGTTGTTCACCAAATACGGCATCGACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAGAAGCAGAACACATCTTTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAATACCGACGACAGTGAGATAGCTAAAAAAAACGGGAAAGACGATTTCATTCTCTCTCCCGTGGAGCCGTTTTTTGACTCCCGCAAAGACAATGGCAATAAGCTTCCGGAAAACGGGGACGATAACGGCGCCTACAACATCGCTCGTAAGGGAATCGTTATCCTCAATAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTGTGAAAAAATGAAGTGGGGGGACCTTTACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGCCAATGCGAGACACTAG SEQATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCCGGCCATATGGGAAG IDACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGATGCTTTTAAGGCTAACTCTAGGAGGTCTNO:ATGCAGGCCATTATCGATGAGAAATTCAAGGAGACCATAGAGGAGCGTCTGAAATATACTGAGTTTTCC157 GAGTGTGACCTAGGAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGCGACAAACCTGAAGAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCCGGACAAAAACATCGACGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTACTTTCAAGGGCTTCACCACATACTTTGTAAATTTCTTCGAAATACGGAAACACATCTTCAAGGGAGAGTCTTCCGGTAGCATGGCTTACAGAATAATCGATGAGAACCTAACTACATATCTAAACAATATCGAGAAGATCAAGAAATTGCCTGAAGAACTGAAATCTCAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTATAACGAGTTCATCACCCAGTCCGGCATTACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCTGAAAATGTCAAAATTCAAGGCATTAACGAAGGGATTAATCTTTACTGTCAAAAGAATAAAGTGAAGCTACCACGCTTAACTCCTCTGTATAAGATGATTCTCTCTGATCGGGTCTCTAATTCCTTTGTGCTGGATACCATTGAAAATGATACCGAGTTAATTGAAATGATCTCTGATCTGATAAATAAGACAGAGATAAGTCAGGATGTTATTATGTCCGACATCCAAAATATTTTCATCAAATATAAACAACTCGGCAACTTGCCGGGGATTAGCTACTCATCTATAGTGAATGCTATCTGTTCGGATTACGACAATAACTTTGGTGACGGCAAACGTAAAAAAAGCTATGAGAATGATCGCAAAAAACACCTCGAGACTAACGTGTATAGCATTAACTATATCTCAGAGTTACTGACAGACACCGACGTCTCCAGCAACATAAAGATGCGGTACAAAGAGCTGGAGCAGAATTATCAGGTATGCAAGGAAAATTTCAACGCCACTAACTGGATGAACATCAAAAACATTAAGCAGTCTGAGAAAACCAATCTGATCAAGGACCTTCTTGACATCCTCAAGAGCATCCAGCGGTTTTATGATTTGTTTGACATCGTGGATGAAGACAAAAATCCTAGTGCTGAGTTCTATACCTGGCTGTCTAAAAACGCGGAGAAACTGGACTTCGAGTTTAATTCAGTGTACAACAAGAGCAGGAACTACCTCACGAGAAAGCAGTACTCCGATAAAAAGATTAAGTTGAACTTCGATAGTCCTACTCTCGCCAAGGGGTGGGATGCGAACAAAGAAATTGATAATAGCACAATTATCATGAGGAAGTTCAACAACGACCGGGGCGATTACGATTACTTCTTGGGGATCTGGAATAAGAGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGATAATGGACTGTTTGAAAAAATGCAATATAAGCTGTATCCCGATCCTAGTAAAATGCTGCCAAAGCAATTCCTTTCTAAGATCTGGAAAGCTAAACATCCAACTACACCCGAGTTTGATAAGAAGTACAAAGAAGGTCGGCACAAGAAGGGGCCTGATTTTGAGAAAGAGTTTCTGCACGAGTTGATCGATTGCTTTAAGCATGGATTGGTAAACCACGACGAAAAATATCAGGATGTGTTCGGGTTCAATCTGCGCAACACGGAAGACTACAACTCTTATACAGAGTTTCTGGAGGACGTCGAAAGGTGCAACTATAATCTTAGTTTCAATAAAATCGCTGACACGTCTAACTTGATAAATGATGGGAAACTCTATGTTTTTCAGATCTGGAGCAAGGATTTCAGCATAGATAGCAAGGGAACAAAAAACTTGAACACAATATACTTTGAATCCCTCTTCTCGGAGGAAAATATGATCGAGAAGATGTTCAAGCTCTCAGGGGAAGCCGAAATATTCTATCGTCCAGCAAGTTTGAATTATTGTGAAGATATTATCAAGAAGGGACACCACCACGCCGAACTGAAGGACAAATTCGACTATCCCATCATCAAGGACAAGCGATATAGCCAGGACAAATTTTTTTTTCATGTCCCCATGGTTATCAACTACAAAAGCGAGAAGTTAAACTCCAAATCACTTAACAATAGGACGAACGAAAATTTAGGCCAATTCACGCACATCATCGGTATCGACCGCGGAGAGCGACATCTCATCTACCTGACCGTGGTGGATGTGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGATGAAATTATAAATACAGATACAAAAGGCGTCGAGCATAAAACTCATTATCTCAATAAATTAGAAGAGAAGTCCAAGACGCGGGATAATGAAAGAAAGTCCTGGGAAGCAATCGAGACGATTAAGGAGCTGAAAGAAGGCTATATTAGCCACGTGATCAATGAAATCCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGGAGAACCTCAATTATGGGTTTAAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGACCGCCCTGATTAAAAAGTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTACATTCATGGATACCAACTGACGAATCCAATCACTACACTCGATAAAATTGGTAACCAGAGCGGTATTGTGTTGTACATTCCGGCTTGGAATACAAGCAAGATTGATCCAGTCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAATACAAGAACCAGGAGCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTGAGTTTAAATTTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGACGAAATGGACACTGACAAGCTACGGAACCCGGATACAGACGTTCCGAAACCCCCAGAAAAATAACAAGTGGGACAGCGCCGAGTATGACCTGACCGAAGAGTTTAAATTAATCCTGAACATCGATGGTACTCTGAAATCTCAGGATGTGGAAACCTATAAGAAATTCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACTGGAACAGACATTGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCACGGGAGAATATCAAGAACCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACATTGCTAGGAAGGGCATCATGGCAATCGAGAATATTATGAACGGCATTAGCGACCCTCTGAAGATCAGTAATGAGGACTACCTGAAGTACATTCAGAACCAACAAGAGTAA SEQATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGATTTGAGCTCATCCCACID AGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTATAGAGGAAGACAAAGCCAGAAACGACCATNO:TATAAGGAACTAAAACCGATCATTGACCGCATCTACAAAACCTATGCCGACCAATGCCTTCAGCTCGTC158 CAACTCGATTGGGAGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACCGAGGAGACTAGAAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTTATCGGCCGCACCGACAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTTATAAGGGGTTATTTAAGGCCGAGCTGTTCAATGGAAAAGTACTGAAACAGCTGGGCACCGTAACAACCACCGAACACGAGAATGCTCTGTTGAGGTCCTTCGACAAGTTTACTACCTACTTTAGCGGCTTCTACGAAAACCGTAAAAACGTGTTTTCCGCGGAGGATATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCCCAAGTTTAAGGAGAACTGCCATATCTTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAACACTTCGAGAATGTTAAGAAGGCTATAGGCATATTCGTTTCCACCTCCATCGAAGAAGTATTCAGTTTTCCATTCTACAATCAGTTACTCACGCAGACCCAGATAGATCTCTACAATCAGCTGCTCGGAGGCATTTCTAGAGAAGCAGGCACGGAAAAGATCAAGGGCTTAAATGAAGTACTCAATCTTGCAATTCAGAAGAACGATGAGACAGCACACATTATTGCATCTCTCCCTCACAGATTCATTCCCCTGTTCAAACAGATCCTGTCCGATCGCAACACACTAAGCTTTATACTTGAGGAGTTTAAGTCAGATGAGGAAGTGATCCAGAGCTTCTGTAAGTATAAGACTTTGCTCCGTAATGAAAACGTGCTTGAGACAGCAGAGGCTCTCTTTAACGAGTTGAATTCCATCGACCTGACACACATTTTTATCAGCCATAAAAAGCTGGAAACGATTAGCTCTGCCTTGTGCGACCACTGGGACACCCTGCGTAACGCCCTCTATGAAAGGCGCATTTCCGAGCTCACCGGGAAGATCACAAAAAGTGCCAAGGAAAAAGTCCAGAGGTCCCTTAAACATGAAGACATCAACCTACAAGAGATCATCTCTGCGGCTGGGAAAGAGCTGTCAGAAGCATTTAAACAGAAGACTTCCGAGATCCTGAGCCACGCACACGCCGCATTAGACCAGCCCCTGCCTACAACTCTTAAAAAACAGGAGGAGAAGGAGATTTTAAAGAGCCAGCTGGACTCATTACTCGGCCTGTATCATCTCCTGGACTGGTTCGCCGTGGACGAATCCAACGAGGTGGACCCAGAATTTAGCGCCAGGCTGACAGGAATTAAACTGGAAATGGAGCCAAGTTTGAGCTTTTACAACAAGGCTCGGAACTATGCCACTAAAAAGCCCTACAGCGTGGAAAAGTTCAAGCTGAATTTTCAGATGCCGACCCTGGCTTCCGGGTGGGATGTTAATAAGGAAAAGAATAATGGGGCTATACTGTTCGTCAAAAATGGTCTCTACTACCTGGGAATCATGCCCAAACAGAAGGGCAGGTACAAAGCCCTTTCGTTTGAGCCGACCGAAAAAACCAGCGAAGGCTTTGATAAGATGTATTACGACTATTTCCCAGATGCAGCCAAGATGATCCCAAAATGTAGCACTCAGTTGAAGGCGGTAACCGCTCACTTTCAGACACACACCACTCCTATCTTGCTCTCCAACAACTTTATTGAGCCGCTGGAGATCACGAAGGAAATCTACGACCTTAACAACCCAGAGAAGGAACCCAAGAAATTCCAAACAGCTTATGCTAAGAAGACTGGGGATCAAAAGGGCTATCGAGAGGCTTTGTGTAAGTGGATTGACTTTACACGGGATTTCCTGAGTAAGTATACCAAGACCACATCTATTGACCTGTCCTCACTGAGACCTTCCTCACAATATAAGGATCTCGGAGAGTATTATGCCGAACTCAACCCTCTACTCTATCACATCTCTTTCCAGAGGATCGCCGAAAAGGAAATTATGGACGCCGTCGAGACAGGCAAGCTGTACCTCTTCCAGATTTACAACAAGGATTTCGCAAAGGGCCACCACGGAAAACCCAATTTGCACACTTTGTACTGGACAGGGCTCTTCTCTCCCGAAAATTTGGCCAAAACTTCAATAAAACTGAACGGGCAAGCCGAGCTGTTCTATCGGCCCAAGTCACGTATGAAGCGGATGGCCCACCGGCTGGGCGAGAAGATGCTCAACAAGAAACTGAAGGATCAGAAGACGCCCATACCAGACACTCTTTACCAAGAGCTGTATGACTACGTGAATCACAGACTGAGTCACGACCTGTCTGATGAAGCCCGGGCTCTTCTTCCAAATGTGATTACCAAAGAAGTTTCCCACGAAATTATCAAGGACCGGCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCACCCTCAACTACCAGGCAGCCAATTCCCCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGCATCCGGAGACCCCCATCATAGGGATAGACAGAGGAGAGCGGAATCTTATCTACATTACTGTGATTGACAGCACAGGTAAGATCTTGGAGCAGAGATCTTTAAATACAATCCAGCAGTTTGACTACCAGAAGAAACTGGATAACCGAGAGAAGGAAAGGGTTGCTGCAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGAAGCAGGGCTACTTATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTTGTGCTGGAGAACCTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGAAAGCCGTGTATCAGCAGTTCGAAAAGATGCTCATAGACAAGCTGAACTGCTTAGTGCTGAAGGATTATCCTGCTGAGAAGGTCGGCGGCGTACTTAACCCATACCAGCTGACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAAGTGGCTTCCTTTTCTACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACCCATTCGTCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGGTTTGACTTCCTGCACTACGACGTGAAGACAGGGGACTTCATCTTACACTTTAAGATGAATCGAAACCTCTCCTTCCAGCGGGGCCTGCCTGGTTTCATGCCCGCATGGGACATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCTAAGGGAACCCCCTTTATTGCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGGCGATACCGGGATCTGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGTGTTTAGGGATGGATCCAACATTCTGCCTAAGTTGCTGGAAAATGATGATTCGCACGCCATTGATACCATGGTTGCACTGATTAGATCCGTACTGCAGATGAGGAATAGCAATGCTGCAACCGGGGAGGATTATATTAATTCCCCAGTGCGAGATCTGAATGGTGTCTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAATGGATGCAGACGCTAACGGGGCGTACCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCAAGGAGAGCAAAGATCTGAAGCTGCAGAACGGCATTTCCAACCAAGACTGGCTCGCCTACATACAAGAACTGCGCAATTAA SEQATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCCGCGCCGGGCTCTGG IDAAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACACAGAATGGCTCTCCCTGCTGCGCCAGGAANO: AATTTGTACCGCCGGTCACCTAATGGAGATGGAGAGCAGGAATGCGATAAAACAGCAGAAGAGTGCAA159 AGCCGAATTGCTGGAGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCCGGCGGGATCGGACGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGCTATTGGGGCAAAGGGTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCGCCGACAAAGACGCTGTCGGCGGCCTTGGGATAGCCAAAGCCGGCAACAAACCCCGATGGGTGCGCATGAGGGAGGCTGGTGAGCCTGGCTGGGAGGAAGAAAAGGAAAAGGCCGAAACCAGAAAGTCCGCCGACAGGACCGCGGACGTACTCCGAGCATTGGCCGATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGTGAAATGTCTAGCGTGGAGTGGAAGCCATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGACATGTTCCAGCAAGCCATCGAGCGAATGATGAGCTGGGAGAGCTGGAACCAGAGAGTGGGGCAGGAGTATGCCAAGCTGGTCGAGCAGAAAAACCGGTTTGAGCAAAAAAATTTTGTAGGTCAGGAACACCTGGTGCATCTCGTTAACCAGCTCCAGCAAGATATGAAGGAAGCTTCGCCTGGATTAGAGAGCAAAGAGCAGACTGCACACTATGTAACCGGAAGAGCACTGAGGGGCAGTGACAAAGTGTTCGAAAAATGGGGAAAACTGGCTCCCGATGCCCCCTTTGACCTGTACGACGCAGAAATAAAAAACGTGCAGCGGCGAAACACCAGGCGATTTGGTAGCCATGATCTGTTCGCCAAATTGGCAGAGCCGGAATATCAGGCTCTTTGGCGAGAAGACGCATCATTTCTCACTAGGTACGCGGTCTATAACTCCATTTTGAGGAAATTGAACCACGCAAAAATGTTTGCCACCTTCACGTTGCCTGACGCCACCGCTCATCCCATTTGGACACGGTTTGATAAGCTGGGCGGCAATCTGCATCAGTATACATTCCTGTTTAACGAGTTTGGAGAGCGAAGACATGCGATACGATTCCACAAGCTACTGAAGGTCGAAAATGGCGTGGCACGTGAGGTGGACGATGTCACCGTGCCCATCAGCATGAGCGAACAGCTGGATAATTTGTTGCCGCGGGACCCAAATGAACCTATAGCCCTTTATTTTAGGGACTACGGGGCGGAGCAACATTTCACTGGGGAGTTTGGCGGCGCAAAAATTCAGTGCCGACGCGACCAGCTCGCCCACATGCATAGAAGACGCGGGGCCCGGGACGTATACCTTAACGTCTCTGTGAGGGTGCAGTCCCAGTCAGAGGCAAGAGGGGAACGCAGACCACCTTACGCAGCAGTATTCAGGCTGGTAGGCGATAACCACCGGGCGTTTGTACACTTTGATAAACTTTCTGACTACCTGGCCGAACACCCGGATGACGGCAAATTAGGATCGGAGGGGCTGCTTAGCGGCCTGCGTGTGATGAGCGTCGATCTGGGGCTACGGACCTCTGCTTCCATCTCTGTGTTCCGTGTGGCCCGAAAGGACGAGTTGAAACCTAATTCGAAGGGCCGTGTACCATTCTTTTTCCCTATTAAGGGAAATGATAATCTCGTCGCGGTGCACGAGCGTTCCCAACTGCTGAAACTGCCTGGCGAGACCGAGTCCAAAGATCTCAGAGCAATCCGGGAGGAGCGACAACGTACACTTAGGCAACTCCGCACCCAGCTGGCCTATCTGCGCTTGCTGGTGCGGTGCGGCTCCGAGGATGTAGGGAGAAGAGAGCGAAGCTGGGCAAAGCTGATAGAGCAACCAGTTGACGCCGCGAATCACATGACCCCCGACTGGCGCGAAGCGTTTGAAAATGAGCTGCAGAAGTTGAAATCTCTGCATGGGATTTGCTCAGATAAGGAGTGGATGGACGCCGTATACGAGTCTGTTCGCCGGGTATGGCGGCACATGGGGAAGCAGGTGAGAGATTGGAGAAAGGACGTTCGCTCTGGGGAACGGCCGAAAATTCGGGGATACGCAAAGGATGTCGTGGGCGGCAATAGCATTGAGCAGATCGAGTACCTGGAAAGGCAATACAAATTTCTGAAATCTTGGTCTTTCTTTGGGAAGGTAAGCGGACAAGTTATCAGAGCCGAAAAGGGATCTCGCTTTGCTATCACATTGAGGGAACACATTGATCACGCCAAAGAAGACAGGTTGAAAAAGTTGGCTGATCGCATTATCATGGAAGCACTCGGTTACGTCTACGCCCTTGATGAGCGCGGTAAAGGGAAGTGGGTAGCCAAGTATCCCCCATGTCAGCTGATCCTGCTCGAGGAACTTTCTGAGTATCAGTTCAATAACGACCGTCCTCCCTCCGAAAATAATCAGCTCATGCAATGGTCCCACCGGGGTGTGTTCCAAGAACTGATCAATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCAGCCTTTAGCTCCCGTTTTGACGCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGCACGGTGCACTCAAGAACATAACCCGGAGCCTTTCCCATGGTGGCTCAATAAGTTTGTTGTGGAGCATACCCTCGACGCTTGCCCATTGAGGGCGGATGACTTGATTCCCACAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCTGCCGAAGAAGGGGATTTCCACCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGCTGTGGTCAGACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCGAGCTGGTGCTGATACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAATAAGGTTTTTTATACGAATACGGGGGTGACATATTACGAGCGTGAGAGAGGCAAGAAGCGTCGGAAGGTGTTCGCGCAGGAGAAGCTGAGCGAAGAGGAGGCGGAGCTACTGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCCTGATGCGGGATCCTAGCGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCATGGTGAATCAAAGAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTCAAGACAGCGCATGTGAAAACACTGGGGACATCTAG SEQATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGGGCCTGTGGAAGACC IDCATGAGGTACTTAACCACGGCATAGCATACTATATGAATATCCTAAAACTTATACGGCAGGAGGCTATCNO: TACGAGCATCACGAGCAAGATCCTAAAAATCCAAAGAAGGTTAGTAAGGCTGAAATCCAGGCTGAATT160GTGGGACTTCGTGCTGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAGGACGTCGTGTTTAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAAGGAGAGGCCAACCAGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATTCACAGAGCGGCAAAGGTACCGCATCTTCTGGGAGGAAACCACGCTGGTACAACTTGAAGATCGCTGGCGATCCCAGCTGGGAGGAGGAAAAGAAGAAATGGGAAGAGGATAAAAAGAAAGACCCCCTGGCCAAAATCTTAGGCAAGCTCGCCGAGTACGGTCTGATTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGTCAAGGAGATTAAGTGGATGGAAAAGAGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGTTTATTCAGGCCCTGGAACGCTTCCTTAGCTGGGAAAGCTGGAACCTGAAGGTTAAGGAAGAGTACGAAAAAGTCGAGAAGGAGCATAAGACTTTGGAGGAGCGCATCAAAGAAGACATCCAGGCCTTTAAGTCTCTAGAACAGTATGAGAAAGAACGGCAGGAACAGCTGCTGCGTGATACACTGAACACAAACGAATATCGCCTGAGCAAGAGGGGACTCAGAGGCTGGAGAGAAATCATTCAAAAGTGGCTCAAAATGGATGAAAATGAGCCGTCTGAAAAATACCTTGAAGTTTTCAAGGACTACCAGCGGAAGCACCCTAGAGAAGCCGGCGACTATAGTGTTTACGAATTCTTGAGCAAGAAGGAGAATCATTTTATATGGAGGAATCACCCGGAGTACCCATATCTGTACGCAACCTTCTGCGAAATCGACAAGAAAAAAAAAGACGCCAAGCAACAGGCTACATTTACTCTGGCCGACCCTATCAATCACCCTCTATGGGTCCGGTTTGAGGAGCGCTCCGGAAGCAATCTGAATAAATATCGTATTCTGACTGAACAGTTACACACAGAGAAGCTCAAGAAGAAACTTACGGTGCAGCTGGACCGCCTGATATACCCAACAGAGTCCGGAGGATGGGAAGAGAAAGGAAAGGTTGACATCGTACTGCTTCCATCTCGTCAGTTTTACAACCAGATATTCCTGGACATCGAGGAGAAGGGGAAACACGCCTTCACATACAAGGACGAGTCCATAAAGTTCCCACTGAAGGGTACTTTAGGCGGTGCTAGGGTGCAGTTCGACCGCGATCACCTGAGACGGTACCCCCACAAGGTGGAGAGCGGGAACGTGGGACGAATCTACTTTAATATGACAGTGAACATTGAACCCACAGAGAGTCCAGTTAGTAAATCCCTGAAAATTCACCGTGACGACTTTCCGAAATTTGTGAATTTCAAGCCAAAGGAGCTTACGGAGTGGATCAAGGATTCAAAGGGAAAGAAGCTGAAATCTGGTATCGAATCTCTCGAGATCGGTCTCCGTGTCATGAGCATCGATCTGGGACAGCGCCAGGCAGCTGCCGCCAGTATATTCGAGGTGGTAGACCAAAAGCCTGACATCGAGGGAAAGCTCTTCTTCCCAATCAAAGGCACAGAGCTGTATGCGGTGCACCGGGCGTCCTTTAATATAAAGCTGCCCGGTGAAACCCTGGTGAAGTCACGGGAGGTGCTTAGAAAAGCGCGAGAGGATAACCTCAAACTGATGAACCAAAAACTGAACTTTCTGAGGAACGTCCTGCACTTTCAGCAGTTCGAAGATATTACCGAACGCGAAAAGAGAGTAACCAAGTGGATATCTCGTCAAGAGAACAGCGACGTCCCGTTAGTCTATCAGGACGAACTCATCCAAATACGGGAGTTGATGTATAAGCCCTACAAGGATTGGGTCGCCTTTCTTAAGCAGCTTCACAAACGCCTAGAGGTCGAAATAGGTAAAGAGGTGAAACATTGGCGGAAGTCGCTCAGCGACGGGAGGAAGGGACTTTATGGCATCTCTTTGAAGAACATTGACGAAATCGATAGAACCAGAAAATTTTTGTTGAGATGGTCCCTCCGACCCACCGAGCCTGGAGAGGTGAGGCGGTTAGAACCAGGACAGAGGTTCGCTATCGATCAGCTGAATCACCTCAATGCTCTGAAGGAGGACCGCCTCAAGAAAATGGCCAATACAATCATAATGCACGCCCTTGGCTACTGCTACGACGTCCGAAAGAAGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATCCTTTTTGAGGATCTTAGCAATTACAACCCCTATGAAGAGCGGTCCAGATTCGAAAATAGTAAGCTCATGAAGTGGAGCCGCAGGGAGATCCCGCGCCAAGTGGCCCTTCAGGGGGAAATTTATGGGCTGCAGGTAGGCGAGGTCGGGGCCCAATTCTCCTCGCGCTTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGTGGTGACAAAGGAGAAGTTGCAAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCCTGGATAAGATAGCCGTACTTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAAAAGTTCATTAGCTTGAGCAAGGACCGAAAACTTGTGACCACCCACGCTGACATCAATGCGGCACAGAACCTGCAGAAGAGATTTTGGACTCGCACCCACGGATTCTACAAAGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTATACATCCCCGAGTCCAAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTGAAGGATGGTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAGCTCTAAACAGTCCTCCAGCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTCGACCTGGCCAGCGAACTTAAAGGGGAAAAATTAATGCTGTACCGGGACCCCAGCGGAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAGTGTTCTTTGGCAAGTTAGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAATCGAGGATGACTCATCCAAGCAGTCTATGTGA SEQATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATGCTACCCTTCGTCGG IDGCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAAACGGATTGAAGAGTTCCTCTTGCTTTGTCNO: GGGGGGAAGCATATAGGACGGTGGATAATGAGGGGAAAGAGGCTGAAATTCCGAGACACGCCGTGCAG161GAGGAAGCTCTTGCGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTATGAAGACCAGGAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAATGAAAATAATGAAGCAGGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGATGTCCGCCGAGTCCGAGGGAGGGCTCAGCGTTTACGACAAGGTGCTGGACCCACCCCCAGTGTGGATGAAACTCAAAGAGGAAAAAGCTCCGGGCTGGGAGGCTGCTTCCCAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCTTCTTAACAAGCCTGGTTCGCCCCCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGACGATTTTGTTAGCGACCAGAAAAAGAAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATCAAACAATTAAAGGAAATGGGCTTATTGCCTCTTGTGAATCCCTTTTTTAGACATCTGCTTGACCCGGAGGGGAAGGGGGTGTCACCTTGGGACAGACTCGCTGTTAGGGCCGCTGTCGCTCATTTCATATCATGGGAATCATGGAACCACCGGACACGCGCCGAATACAATAGTTTGAAGCTGCGGAGGGATGAGTTCGAAGCAGCTTCCGACGAATTCAAGGACGACTTCACGCTGCTTCGGCAGTACGAGGCTAAGAGGCACTCCACACTGAAGAGTATAGCTTTAGCCGATGATTCAAACCCTTATAGGATCGGCGTACGCTCCCTCCGCGCTTGGAACCGCGTCCGCGAGGAGTGGATCGACAAGGGAGCGACCGAGGAGCAGCGGGTCACCATTCTCAGCAAGTTGCAGACCCAACTAAGGGGCAAATTTGGAGATCCTGACTTGTTCAACTGGCTGGCGCAGGACCGGCACGTGCACCTCTGGAGCCCTAGAGATAGTGTTACCCCACTGGTTAGGATCAACGCTGTTGACAAAGTATTGCGACGGAGAAAACCGTACGCCTTGATGACTTTTGCCCACCCAAGATTCCACCCTCGGTGGATACTTTACGAAGCCCCAGGGGGCAGCAATCTCCGCCAGTATGCACTGGATTGTACCGAAAATGCTCTGCACATTACACTGCCTCTGCTGGTTGACGATGCACATGGCACATGGATTGAGAAAAAAATTAGGGTTCCTCTTGCCCCCAGCGGCCAGATTCAGGACCTGACACTAGAAAAGCTCGAGAAGAAGAAAAATCGTCTCTACTACCGTTCTGGGTTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCGAGGTGCTTTTCCATCGACCATACATGGAGCACGATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCTGGTGCTGTTTGGTTCAAACTCACCTTGGACGTGGCAACCCAGGCCCCTCCAAACTGGTTGGACGGAAAGGGCCGCGTCCGAACGCCCCCCGAGGTTCACCACTTCAAGACAGCCCTCAGTAACAAGTCTAAGCACACACGGACCCTCCAGCCCGGACTCAGAGTGTTATCCGTGGATCTGGGAATGCGCACCTTCGCCTCTTGCTCCGTATTTGAGCTGATCGAGGGCAAACCAGAGACTGGCAGAGCGTTCCCTGTGGCCGACGAACGTTCCATGGATTCACCAAACAAGCTGTGGGCCAAGCACGAAAGATCCTTTAAACTCACGCTCCCCGGCGAAACCCCCAGTCGGAAAGAAGAGGAGGAACGGAGCATTGCAAGAGCCGAAATCTATGCGTTGAAAAGAGATATTCAGAGATTAAAAAGTCTTCTGCGCCTGGGGGAAGAGGATAACGATAATAGACGCGATGCACTTCTTGAGCAATTTTTCAAGGGCTGGGGCGAGGAAGACGTGGTTCCAGGTCAGGCCTTTCCCCGGAGTCTGTTCCAGGGGCTGGGGGCCGCCCCATTCAGATCCACCCCTGAGTTGTGGAGACAACACTGTCAAACCTATTATGATAAAGCAGAGGCGTGCCTGGCTAAACACATCAGCGATTGGCGCAAGAGAACCAGGCCTAGGCCTACCTCACGTGAGATGTGGTACAAGACACGCTCTTATCACGGCGGAAAGTCAATCTGGATGCTGGAATACCTCGACGCTGTGAGGAAACTGCTCTTATCCTGGAGCCTCAGAGGCCGGACCTACGGGGCTATCAACAGACAGGACACAGCAAGGTTCGGGAGCTTAGCCAGCCGGCTCCTTCACCACATTAACTCACTCAAAGAGGATCGAATAAAGACCGGAGCCGACTCGATCGTGCAGGCAGCCCGAGGGTACATCCCCCTGCCTCATGGGAAGGGCTGGGAGCAGCGATATGAACCCTGCCAGCTGATCTTGTTTGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACCTCGCCGTGAGAATTCTCAGCTGATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGCCGAGCTGTATGGACAGATCGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCATGCTGCCACCGGAGCTCCCGGCGTCCGGTGCCGCTTCCTCTTAGAGCGTGATTTTGACAATGACCTCCCAAAGCCCTATCTGCTGAGGGAACTGAGCTGGATGCTGGGGAACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTCCTCTCCGAAAAGATACGTCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGCATCCTAAGAGACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGCAGCGAAGATTCTTTGGCCGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCACGGGGATGATGTGCTGCGGCTGGCCTCAACCCCTGGGGCTAGACTCTTGGGGGCACTCCAGCAGCTGGAAAATGGCCAAGGGGCTTTCGAACTCGTTCGGGACATGGGCAGCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAGAAAAAGATCAAACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTGCCAGAGGAAGACGACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATATTCTTTCCTTGCAACGTGTGGATTCCGGCCAAACAGTTCTGGCCTGCCGTCAGAGCCATGATTTGGAAAGTGATGGCTAGTCATTCATTGGGATGA SEQATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCAAGAAACGTGAGGT IDCCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCCTGTAAAGAAGGGGCAGGTAACAGAATTNO: TAGAAAAGCATTCTCCGCGTACGCTCGGGCAACTAAGGGGGAAATGACCGATGGACGGAAGAACATGT162TCACCCATTCTTTCGAGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGCCGATAAGGCTTACCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCACACGCACTGGGCTTTCGTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGGCCAGCTCAAAAATCGAGGCCTATGAGTCCAAGCTCGCTTCGGAGCTAGCCTGTGTCGATTTGAGTATCCAGAATTTGACGATTAGTACTCTTTTCAACGCTCTCACAACTTCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATCCCCTTATCGCACGGTTCTACACTCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAGAACGGGACTTGGCAGAGGTCATCTCCAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAAATGACTGCCAACCCTCTGCAGAGCCTCCAGTTCTTCGAAGAAGAGCTTCATGCACTAGATGCCAACGTGTCTTTATCTCCAGCTTTTGATGTGTTAATCAAGATGAATGATCTCCAAGGTGATCTGAAGAACCGTACTATAGTGTTCGACCCAGATGCACCCGTGTTCGAGTACAACGCTGAGGATCCAGCCGATATCATCATAAAGCTGACAGCTCGGTATGCGAAGGAGGCCGTCATCAAGAATCAGAACGTGGGCAATTATGTGAAAAACGCCATTACCACCACTAATGCCAATGGGCTGGGGTGGCTCCTCAATAAAGGGCTTTCACTACTGCCAGTTTCTACTGACGATGAGCTGCTCGAATTCATTGGGGTGGAGAGAAGCCATCCCAGCTGTCACGCGCTGATAGAGCTGATTGCCCAGCTAGAGGCGCCGGAACTGTTTGAGAAGAATGTGTTTAGTGACACCCGTTCCGAGGTTCAGGGTATGATCGACAGTGCAGTGTCGAACCACATTGCTCGGCTGTCCAGCAGCCGAAACTCCCTGAGCATGGACAGCGAGGAATTGGAACGCTTGATTAAATCTTTCCAGATTCATACTCCCCATTGTTCTCTGTTCATAGGCGCTCAGTCCTTATCTCAGCAGCTGGAGAGCTTACCTGAGGCGCTGCAGTCCGGAGTGAACAGCGCTGATATCTTATTAGGCAGCACACAGTATATGCTGACCAACTCTCTCGTTGAAGAGTCAATTGCAACATATCAAAGGACATTAAATAGGATCAATTACCTGAGTGGGGTGGCTGGGCAGATTAACGGTGCTATCAAAAGAAAGGCAATCGACGGCGAAAAAATACACCTGCCTGCCGCCTGGAGTGAGCTCATCTCCTTACCTTTCATTGGACAGCCGGTGATTGATGTGGAGAGCGACCTGGCACACTTAAAAAACCAGTACCAGACCCTGTCCAATGAATTTGACACCCTCATTTCGGCCCTGCAGAAGAACTTCGATTTGAATTTCAACAAAGCACTCCTTAACCGCACGCAGCATTTCGAGGCAATGTGCCGGAGCACAAAAAAAAATGCTTTATCTAAGCCCGAGATCGTGTCCTACAGAGATCTGCTGGCGCGGCTGACCAGTTGCCTTTATCGAGGCTCGCTGGTTCTCAGAAGGGCGGGAATCGAAGTTCTGAAAAAGCACAAAATCTTTGAGTCGAATAGTGAGCTGAGAGAACACGTCCACGAGCGAAAGCACTTCGTGTTCGTTAGTCCATTGGACAGAAAGGCAAAAAAACTGTTGCGCCTGACCGATTCCCGCCCTGACTTGCTCCATGTGATCGATGAGATCCTGCAACATGACAATCTGGAGAATAAGGACAGAGAGTCCCTTTGGCTGGTCCGGTCTGGGTACCTCCTTGCTGGTCTGCCGGACCAGCTGAGTTCTTCGTTTATCAATCTCCCCATAATCACGCAAAAGGGCGATCGCCGGCTGATTGACCTGATTCAGTATGACCAGATCAATCGCGATGCTTTCGTAATGTTGGTGACAAGTGCTTTCAAAAGCAATCTCTCTGGGTTGCAGTACCGCGCTAACAAGCAGTCTTTCGTGGTCACCCGCACCCTGTCTCCTTACCTGGGTAGTAAGCTCGTATACGTCCCTAAAGACAAAGATTGGCTGGTCCCATCCCAGATGTTTGAGGGAAGATTCGCCGATATTCTGCAGAGTGACTACATGGTCTGGAAGGATGCCGGACGCCTGTGCGTGATCGACACTGCCAAACATCTCTCTAACATTAAAAAAAGCGTGTTTAGTAGCGAAGAAGTCCTTGCTTTTCTTCGAGAGCTGCCTCACCGGACCTTCATCCAGACCGAGGTACGGGGGTTAGGAGTGAACGTCGATGGAATCGCATTTAATAACGGGGATATCCCGAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAGGACTAATACTAGTCTCGTGCAGACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTCCCTCTATTCAGTTCGAAAGAGCTTACTACAAAAAAGACGATCAGATTCACGAGGACGCAGCCAAGAGAAAGATACGCTTCCAGATGCCAGCAACGGAATTAGTGCACGCCAGCGATGACGCTGGTTGGACCCCCAGCTACCTGCTGGGCATCGACCCCGGTGAGTACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTGGATTCTGGATTCATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAGACCAAAGTAGTCCCCCGCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGCAGTCAAAGGATTCAGCAGCAGGGGATATAGCTCACATTCTGGATCGGCTAATCTACAAATTGAACGCCTTGCCTGTGTTCGAGGCGCTGTCTGGCAACAGTCAGAGTGCTGCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGGGAGACAACGACGCACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAAGGGGATGCTGCGCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTTCCCGGGAGTCAAGTGTCATCCTATGGAAATAGTCAAAGGTGTAGTTGTTGCGGCCGCAATCCCATTGAGCAGTTGCGTGAGATGGCAAAGGACACGAGTATCAAGGAGCTGAAAATCCGAAATAGTGAGATCCAACTATTCGATGGTACAATCAAGCTGTTTAACCCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCTCACGCATTCCTGTGGCAGACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAAGGAGCTCATTACTATCGTTTCTCGAAGCATCCGCCATAGCCCCGAATTTATTGCTAAGAAACGGGGTATCGGGTCTGAGTACTTTTGTGCTTATTCTGACTGCAACTCCTCACTGAACTCAGAGGCCAATGCCGCGGCCAATGTGGCACAGAAGTTTCAGAAGCAACTCTTTTTCGAACTCTGA SEQATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAGGAAAGGGCTCAGAG IDCTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCAGGGTGCTGTGGAAGAACTCGCCGAAGCANO: ATACGCCACGACAACCTACATTTATTTGGGCAGAAGGAAATCGTAGATCTGATGGAGAAGGACGAGGG163CACCCAGGTCTACTCGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCTTCAGTCCAAGTGCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCTTTCAGGAAGGTTCTGGAGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGGAGCCCGCTGAACGTATCCTCAGCGTGGAGATCCGCAAGATCGGTAAGAGGGAGAATAGAGTGGAGAACTACGCCGCAGATGTAGAGACTTGTTTTATCGGTCAGCTGTCTAGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAACGATATCTGGGACTCTAAGGATCACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCCCTCATCAAAGACCCAAAGGCAGTGACCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAGAAGCCTCTCGAATTATGTGTGTGCTTAGTGCCCGAGTTGTACACCCGCGGGTTCGGTTCAATAGCGGACTTCCTGGTCCAGCGTCTGACACTATTAAGAGACAAAATGAGCACAGACACAGCAGAAGACTGCCTTGAGTATGTCGGCATAGAGGAGGAGAAGGGTAATGGGATGAACTCGCTGCTGGGGACGTTCCTCAAGAACCTGCAGGGAGACGGGTTCGAACAGATCTTCCAATTTATGCTCGGCAGTTACGTGGGATGGCAAGGTAAGGAAGACGTCCTACGCGAACGGCTTGATTTGCTAGCGGAGAAGGTTAAAAGACTGCCGAAACCTAAGTTTGCCGGCGAGTGGTCCGGCCATCGGATGTTCCTGCATGGTCAATTGAAGAGCTGGTCCTCTAACTTTTTCCGCCTGTTTAACGAGACTAGGGAGCTCCTCGAAAGCATAAAATCCGACATCCAACACGCGACCATGTTAATCAGCTACGTCGAAGAGAAAGGGGGATACCACCCACAACTCTTGTCACAGTACAGGAAACTAATGGAGCAGCTGCCAGCTCTCAGAACAAAGGTGTTAGATCCAGAGATAGAAATGACTCACATGAGCGAGGCGGTAAGGTCGTACATTATGATCCACAAGTCGGTAGCAGGATTTCTGCCTGACTTACTCGAGTCCCTCGATAGGGACAAGGACAGGGAATTCCTGCTGAGTATATTTCCAAGGATCCCCAAAATTGACAAAAAAACTAAGGAAATCGTGGCCTGGGAGCTCCCAGGCGAGCCCGAAGAAGGATACCTGTTCACTGCCAATAATCTTTTTCGCAACTTTCTGGAGAATCCTAAACATGTTCCACGTTTCATGGCAGAAAGGATCCCGGAAGATTGGACGCGCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATGGTGAAACAATGGCAGAAAGTGGTAAACCAGCTGGTGGAGTCACCTGGAGCATTGTATCAGTTCAATGAAAGCTTTCTCCGACAACGTTTACAGGCAATGCTGACAGTGTATAAGAGAGACCTGCAGACAGAGAAATTCCTTAAGTTGTTGGCTGATGTCTGCAGGCCTCTGGTGGACTTCTTTGGGCTGGGGGGAAACGATATCATCTTCAAAAGCTGCCAGGACCCGAGGAAACAATGGCAAACTGTCATTCCCTTGAGTGTCCCCGCTGATGTGTACACCGCGTGTGAGGGGCTGGCAATCCGGCTTCGTGAGACATTGGGATTTGAGTGGAAGAACCTTAAGGGCCATGAAAGGGAGGACTTTCTAAGACTGCACCAGCTTTTAGGGAATCTGCTTTTCTGGATTCGAGATGCCAAACTGGTGGTGAAATTGGAAGATTGGATGAATAATCCCTGTGTTCAGGAGTACGTTGAGGCTCGTAAGGCCATTGATCTCCCACTGGAGATCTTCGGCTTTGAGGTCCCCATCTTCCTGAACGGATATCTGTTTAGTGAACTGAGGCAGTTAGAACTGCTGCTCCGCCGTAAGTCGGTTATGACCAGCTATTCGGTTAAGACAACTGGCAGTCCAAACAGGCTTTTCCAGTTAGTCTACCTGCCATTAAATCCTTCCGACCCTGAGAAAAAAAATTCTAATAACTTTCAGGAACGCCTGGACACCCCCACTGGCTTATCACGTCGCTTCCTGGACCTTACTCTGGACGCCTTCGCCGGCAAGTTGCTGACAGACCCCGTGACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGATCACCTGTTTGGTTTCAAGCTCCCATGTAAGCTGGCAGCCATGTCTAACCACCCTGGCTCTAGCAGCAAGATGGTCGTGTTGGCCAAACCTAAAAAAGGGGTTGCATCTAATATAGGATTCGAACCAATCCCTGATCCCGCGCACCCCGTATTCCGGGTGAGATCATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAGAAGACACTCCACTGACAATAGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCGTGGCATTCGATCTGAAAAATCTGACTACTATCCTTGGACGCGTGGGTGAGTTCCGTGTGACCGCAGACCAGCCTTTTAAGTTGACCCCCATCATCCCTGAGAAGGAGGAGTCCTTCATAGGAAAAACATATCTAGGCCTTGATGCCGGGGAACGCTCAGGCGTAGGGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCGCCTGGGGGTGCATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAGGAGCCGGTGTTCCAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTAGAAAATCTCTCAGAGGATGTTATTGGAACTTCTATCACGCTCTGATGATTAAGTACCGCGCCAAGGTAGTGCACGAAGAGAGCGTGGGCAGTTCCGGCCTGGTTGGGCAGTGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATGTGTTGCCAAAAAAGGGAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACACATTGTGGGGAGGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAAGCTGCTGGTTCTAGCCAGTTCTGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAATGCGTGAGGTTAATCGCGTGCAGGAATCCGGCGTCGTGCTGGATTGGAATCGGAGTATTGTCACATTCCTGATTGAGAGCTCTGGCGAGAAAGTGTATGGGTTCTCCCCTCAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAAGAAGATGGTTCGGGATTTCATGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTACGAAAGGTTTGTCTTGGGACGCCGGCATCGGCGGTATAGATTCGACAAGGTTTTTGAAGAACGATTCGGACGATCCGCGCTATTCATTTGCCCGAGGGTTGGCTGTGGCAACTTTGACCACAGCAGCGAGCAGTCAGCCGTAGTGCTGGCTCTAATCGGATATATTGCCGACAAAGAGGGGATGAGCGGAAAAAAGCTAGTCTACGTGCGTCTGGCAGAACTAATGGCGGAATGGAAATTGAAGAAACTGGAGAGGAGTAGAGTTGAGGAGCAAAGCTCCGCTCAGTGA SEQATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACGAAGTGATCGGCCT IDCGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAATCACACATCTGCAAGAAAGATTCAGA NO:ATAAGAAGAAAAGGGACAAGAAGTATGGATCTGCCAGTAAAGCACAAAGCCAACGAATCGCAGTTGCA 164GGGGCCTTATACCCGGATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCGACCTGAATGGTGAGGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGATGAAATAGGGCTCCTCGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAACAGAGCGAGCCCTACAGTGTTGTAGACTTTTGGTTTGACGCTGTGTGCGCCGGAGGCGTGTTCGCCTATTCTGGGGCTAGATTGCTGTCTACCGTCCTGCAGCTATCTGGGGAGGAGAGCGTCCTACGCGCAGCCCTGGCATCCTCCCCTTTTGTCGACGATATCAATCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACCGGCCAAGATAAGCTGGGGAAGCGCATTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGGCATCAAGGACCGGATGCGTGAATTTGTCCAGGCTATCGATGTCGCTCAGACCGCTGGGCAGCGTTTTGCCGCGAAACTGAAAATCTTTGGGATTTCTCAGATGCCCGAGGCAAAGCAGTGGAACAATGACAGCGGACTCACCGTGTGCATCCTGCCCGACTATTACGTCCCAGAAGAAAATCGCGCAGATCAGTTGGTCGTCCTGCTAAGACGACTGAGAGAGATAGCATACTGTATGGGGATCGAAGATGAGGCCGGTTTTGAACATCTTGGAATTGATCCTGGCGCACTATCAAATTTTTCCAATGGCAATCCTAAACGCGGATTTTTGGGCCGCCTGCTGAACAATGATATTATTGCCTTAGCGAACAACATGTCCGCCATGACGCCTTACTGGGAGGGCAGGAAGGGAGAACTGATTGAAAGATTGGCTTGGCTGAAGCACCGTGCAGAGGGGCTTTATCTGAAGGAACCGCATTTTGGAAATAGTTGGGCCGACCATAGGTCTAGAATTTTTTCCAGAATAGCCGGGTGGCTTTCTGGGTGCGCTGGGAAGCTAAAGATCGCCAAAGACCAGATCAGCGGAGTGCGTACTGATCTGTTCCTTCTGAAGAGACTGCTGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCGACTTCATAGCCTCTATCTCTGCCTTGGATCGCTTCCTGGAGGCCGCAGAATCTAGTCAGGATCCTGCCGAACAGGTGAGGGCCCTATACGCCTTTCATCTGAACGCACCCGCGGTGCGAAGCATCGCCAACAAGGCAGTCCAGCGATCCGACAGCCAAGAATGGCTTATAAAGGAACTGGACGCTGTGGACCACCTGGAGTTTAACAAGGCCTTTCCCTTCTTCTCTGATACGGGAAAGAAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTCCGAGGAGGAGTACACCGAGACTGAGAGCATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTTAATGGTCAGGAAGGCAACGGGGCCTCGAAGAACCAGAAGAAGTTTCAGAGAATCCCCCGATTCTTCGGCGAGGGGAGTCGCAGCGAGTATCGCATCCTCACTGAAGCCCCGCAGTACTTCGACATGTTCTGTAACAACATGCGGGCCATCTTTATGCAATTAGAATCCCAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTTCCTGCAGAATCGGCTGCAGAAATTGTATAAGCAGACATTCCTGAACGCTCGATCCAACAAGTGCCGGGCATTACTAGAGTCCGTATTGATTAGTTGGGGAGAGTTTTACACCTACGGGGCTAACGAGAAAAAATTTCGACTGCGTCATGAAGCTTCTGAGCGCTCCTCGGACCCAGATTACGTGGTGCAACAGGCGCTGGAGATCGCTCGGAGGCTGTTTCTCTTCGGCTTTGAGTGGAGGGACTGTAGCGCAGGTGAAAGAGTGGATCTGGTCGAAATACATAAGAAAGCCATATCTTTCCTGTTGGCCATCACTCAGGCTGAGGTGTCTGTGGGCAGCTATAACTGGCTGGGCAATTCTACCGTGAGTCGGTACCTGTCCGTGGCAGGGACTGATACCCTTTACGGCACCCAGCTGGAAGAATTCTTAAATGCAACCGTGTTATCTCAGATGCGGGGGCTGGCTATCAGGTTATCATCTCAGGAACTGAAGGATGGATTTGACGTACAGCTGGAGTCTAGTTGCCAGGATAATCTGCAACACTTGCTCGTGTACAGGGCTTCACGAGACCTTGCCGCCTGCAAGCGCGCTACTTGTCCAGCTGAGTTGGATCCTAAGATTCTGGTACTGCCCGTGGGGGCCTTTATCGCTAGCGTGATGAAAATGATTGAAAGAGGGGATGAGCCTTTAGCTGGAGCTTATCTGAGACACAGACCCCATAGTTTCGGGTGGCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACCAAGGGACCGCCCTGGCGTTCCAGAAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTCAATACGGCCCCGTTCTATGGCTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCTGGACGGGTTCCTATCACAGCCCAAGAACTGGAGTATGCGGGTGCTGCCACAGGCCGGCTCAGTGCGGGTAGAACAGCGCGTCGCCTTGATTTGGAATCTCCAGGCCGGAAAGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATGCCCGTCCCATTCAGTTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCTGGGACTCTTTCCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACTCTGCCGGCTTCAAAATACTCGAGAGAGGAACAATAGCCGTCAATGGCTTCTCCCAGAAACGAGGAGAAAGACAAGAGGAAGCCCATCGCGAAAAACAAAGACGCGGTATCTCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGAAGTCGATGCGGCCAACGAGCTCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTGTCGTTCAATGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCTAGGGCCGTGAGGACTGAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGATGAAATCTTCCTGGGGATACACATGGGGTACGTATTGGGAAAAAAGGAAGCCCGAGGACATCCTCGGCATTAGTACCCAGGTGTATTGGACAGGCGGGATCGGCGAGTCCTGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGCCACCTCAACCCAGACTGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCCATCCTGA SEQATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAAAACCAGTTAGTCGA IDAGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGATGATCTGAAGAAAAGGCTCGAAAAGCG NO:ACGCAAGAAGCCTGAGGTAATGCCTCAGGTTATAAGTAACAATGCAGCAAACAATCTGCGGATGCTGCT165TGACGATTACACAAAGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAAGGATGATCACGTAGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAAGCTGAAACCCGAGATGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTTCCCAGTGTGGGCAGCCACTGTTCGTGTACAAGTTAGAACAGGTGTCGGAAAAAGGAAAGGCATACACTAACTACTTTGGACGGTGCAATGTTGCAGAACACGAAAAGCTGATACTGCTTGCCCAGCTTAAGCCCGAAAAAGACAGCGACGAAGCGGTGACCTACAGCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTATCCACGTTACCAAGGAGAGCACCCACCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGCCAGCGGACCAGTCGGCAAGGCCCTGTCCGATGCCTGTATGGGCACAATTGCTTCTTTCCTGTCCAAGTACCAGGACATCATAATCGAGCACCAAAAAGTTGTGAAAGGGAATCAGAAACGCCTGGAATCCCTTCGAGAACTGGCCGGCAAGGAGAACCTTGAGTACCCGTCCGTGACCCTGCCTCCACAGCCACATACCAAAGAGGGCGTAGACGCGTATAATGAGGTCATTGCCCGCGTTCGCATGTGGGTTAATTTAAACCTGTGGCAGAAATTAAAACTAAGCCGAGATGATGCTAAACCGTTACTGAGATTGAAGGGATTCCCTAGCTTTCCTGTGGTGGAGAGAAGGGAAAACGAGGTTGATTGGTGGAATACTATTAATGAGGTGAAAAAGCTTATTGACGCCAAGAGGGATATGGGCAGGGTGTTCTGGAGCGGGGTGACTGCCGAAAAGAGAAATACCATCCTCGAGGGATACAATTACCTCCCCAACGAGAATGATCATAAGAAAAGAGAGGGGAGCTTAGAGAATCCAAAGAAACCTGCAAAGAGGCAATTCGGTGATCTCCTGCTCTACCTCGAGAAGAAATACGCGGGGGACTGGGGAAAAGTTTTTGACGAAGCCTGGGAGCGCATTGACAAGAAGATCGCCGGGCTGACGTCTCACATTGAACGGGAAGAGGCACGGAATGCAGAGGACGCCCAGTCTAAGGCCGTGCTGACTGACTGGCTGCGCGCAAAGGCCTCCTTCGTGCTCGAACGTCTGAAGGAAATGGATGAGAAAGAGTTTTACGCGTGTGAAATACAGCTGCAGAAGTGGTACGGCGATCTAAGGGGAAATCCCTTCGCAGTGGAAGCCGAGAATAGGGTAGTTGACATCAGTGGGTTCTCCATCGGCAGTGATGGACATTCTATCCAGTATAGAAACCTGCTCGCCTGGAAGTACTTAGAGAACGGCAAGAGAGAGTTCTATCTGCTGATGAACTACGGGAAAAAAGGTAGAATTCGCTTTACAGATGGCACCGACATAAAGAAGTCCGGAAAGTGGCAAGGCCTCTTATACGGAGGCGGCAAAGCAAAGGTGATAGACTTGACTTTTGACCCTGACGACGAACAGCTGATAATCTTGCCGCTGGCCTTTGGCACAAGACAAGGTAGGGAATTTATCTGGAATGATCTTCTTTCTCTCGAGACCGGACTCATCAAGCTCGCAAACGGAAGGGTCATCGAGAAGACAATCTACAATAAAAAGATAGGCCGAGACGAGCCAGCCCTGTTTGTGGCTTTGACATTTGAGCGGAGAGAGGTCGTAGATCCCAGCAACATCAAACCCGTGAACCTGATCGGTGTTGACAGGGGCGAGAACATCCCGGCGGTTATCGCACTGACGGATCCAGAAGGATGTCCTCTGCCCGAGTTCAAAGATTCATCGGGAGGGCCAACCGACATTTTGAGGATAGGGGAGGGGTACAAGGAGAAGCAGCGAGCTATCCAGGCGGCCAAAGAAGTGGAGCAACGAAGAGCTGGTGGTTATTCTCGCAAGTTCGCTTCCAAAAGTCGTAACCTGGCTGACGATATGGTGCGCAATTCTGCCCGTGACCTTTTCTACCACGCCGTTACACACGACGCCGTGTTAGTGTTTGAAAATCTTAGTCGAGGCTTCGGGCGACAGGGGAAGCGGACCTTTATGACCGAGAGACAGTATACAAAAATGGAGGATTGGCTGACCGCCAAACTGGCGTATGAAGGACTCACATCCAAGACCTATCTCTCAAAAACTTTGGCCCAGTATACATCTAAGACGTGCAGTAACTGTGGCTTCACCATTACCACAGCTGACTACGATGGCATGCTGGTCCGCTTAAAAAAGACATCTGACGGCTGGGCTACTACCCTCAACAATAAAGAGCTCAAAGCCGAAGGACAAATTACCTATTATAACAGGTATAAAAGACAGACTGTCGAGAAGGAGTTGAGCGCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGATATCTCTAAGTGGACTAAGGGACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCGACCCGTGCAGGAGCAGTTCGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGATGAGCAAGCTGCTCTAAATATTGCCCGTAGTTGGTTGTTCCTGAACAGCAATTCAACAGAGTTCAAGTCATACAAGAGCGGAAAGCAGCCGTTTGTGGGCGCATGGCAGGCATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGCCSEQ ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAAAGGCTGGCAAGACID GGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGACCTCAGAGAGCGACTGGAAAATTTACGNO:CAAGAAGCCAGAGAACATACCTCAGCCAATTAGTAATACCTCTCGGGCAAACCTAAACAAGTTGCTTAC166TGATTACACGGAGATGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAAGGACCCTGTCGGGCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACTCATCCCAGTTAAGGACGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCACAGTGCTGTCAACCGCTGTACGTTTATAAGTTAGAACAAGTGAATGACAAAGGAAAGCCTCACACAAATTATTTTGGCCGGTGTAATGTCTCTGAGCATGAGCGTCTGATTCTGTTGTCCCCGCATAAACCGGAAGCTAATGACGAGCTCGTAACCTACAGCTTGGGGAAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGACCCGCGAATCCAATCATCCCGTCAAGCCCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGCCCTGTGGGCAAAGCCCTGTCCGACGCCTGTATGGGAGCCGTGGCTTCATTCCTGACCAAATATCAGGATATCATCTTGGAGCACCAGAAAGTGATCAAGAAAAATGAAAAAAGGTTAGCAAACCTCAAGGATATTGCAAGCGCTAACGGCTTGGCTTTTCCTAAAATCACACTTCCACCTCAGCCTCACACAAAGGAAGGCATCGAGGCATACAACAATGTGGTGGCCCAGATCGTCATCTGGGTTAACTTAAACCTGTGGCAGAAACTTAAAATTGGCAGGGATGAGGCAAAACCCTTACAGCGCCTGAAAGGATTCCCCAGCTTTCCACTGGTGGAGCGCCAGGCTAACGAAGTGGACTGGTGGGATATGGTGTGTAACGTCAAGAAGCTCATCAATGAAAAGAAAGAGGACGGTAAAGTCTTCTGGCAGAACCTCGCCGGTTACAAACGGCAGGAGGCGCTGTTACCTTATCTGTCGAGTGAAGAGGACCGGAAAAAAGGCAAGAAATTTGCTCGTTATCAGTTTGGTGATTTGCTCCTACATTTGGAGAAGAAGCACGGCGAGGACTGGGGAAAAGTATACGATGAGGCCTGGGAGAGGATTGACAAAAAGGTGGAGGGACTGTCAAAGCACATCAAGCTCGAAGAAGAGCGCAGAAGCGAGGACGCCCAATCCAAAGCAGCGCTGACTGACTGGCTGCGGGCGAAGGCCAGTTTTGTAATCGAAGGCCTTAAAGAAGCCGACAAGGATGAATTCTGCAGATGCGAATTAAAACTCCAGAAGTGGTACGGCGATCTCCGAGGTAAGCCTTTCGCAATCGAGGCCGAGAATTCCATACTGGACATTAGTGGATTCAGTAAACAGTATAATTGTGCCTTTATATGGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCTTATTATTAATTATTTCAAAGGCGGGAAACTGAGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCGAACCGATTCTACACAGTTATTAACAAGAAATCTGGTGAAATTGTACCCATGGAGGTAAACTTCAACTTCGATGATCCCAATCTGATTATATTGCCACTAGCTTTTGGCAAGCGGCAGGGTAGGGAATTCATTTGGAACGATTTGCTTTCACTGGAAACAGGGTCCCTTAAGCTGGCAAACGGGAGAGTGATTGAAAAGACATTGTACAATCGGAGGACACGTCAGGATGAACCTGCCCTTTTCGTGGCTCTGACATTCGAGCGCAGGGAGGTTCTGGACTCTAGCAATATCAAGCCAATGAACCTGATCGGCATAGACCGAGGAGAGAATATTCCGGCTGTGATCGCACTCACCGATCCCGAAGGATGTCCCCTTTCTCGGTTCAAGGACTCCTTAGGCAATCCAACTCATATCCTGAGAATCGGCGAGTCATACAAGGAGAAGCAGCGAACAATTCAGGCCGCCAAGGAAGTCGAGCAGAGGCGAGCTGGCGGCTACAGCCGTAAATACGCTAGTAAAGCTAAGAACCTGGCCGACGATATGGTGCGCAATACTGCTAGAGACCTGCTGTACTATGCAGTGACGCAGGACGCAATGCTGATATTCGAGAATCTGTCCAGAGGATTCGGAAGGCAGGGCAAGCGGACGTTCATGGCCGAGCGCCAGTATACAAGGATGGAGGATTGGTTAACGGCCAAGCTTGCCTATGAGGGGCTACCTAGTAAGACCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAGACCTGCTCAAACTGTGGCTTTACAATCACTTCTGCTGATTATGATAGAGTGCTCGAGAAGCTAAAAAAAACTGCCACCGGCTGGATGACTACTATTAATGGGAAGGAACTGAAAGTGGAAGGACAGATTACCTATTATAATCGCTACAAGCGTCAAAACGTCGTCAAGGACCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCCGTGAACAATGATATCAGCTCCTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAGGTTTTCACATCGGCCGGTCCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACACGCCGACGAGCAGGCAGCACTGAACATTGCCAGATCCTGGCTGTTCCTTAGGTCCCAGGAATATAAGAAGTACCAGACTAACAAAACCACGGGAAACACAGATAAAAGGGCCTTTGTCGAAACTTGGCAATCCTTTTACCGGAAGAAGTTAAAGGAAGTGTGGAAGCCC SEQATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGGCGGTGATCACTGAT IDGAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAATACAGACCGCCACAGTATCAAAAAAAATNO:CTTATAGGGGCTCTTTTATTTGACAGTGGAGAGACAGCGGAAGCGACTCGTCTCAAACGGACAGCTCGT167AGAAGGTATACACGTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATGAGATGGCGAAAGTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGACAAGAAGCATGAACGTCATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATCATGAGAAATATCCAACTATCTATCATCTGCGAAAAAAATTGGTAGATTCTACTGATAAAGCGGATTTGCGCTTAATCTATTTGGCCTTAGCGCATATGATTAAGTTTCGTGGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATAATAGTGATGTGGACAAACTATTTATCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCCTATTAACGCAAGTGGAGTAGATGCTAAAGCGATTCTTTCTGCACGATTGAGTAAATCAAGACGATTAGAAAATCTCATTGCTCAGCTCCCCGGTGAGAAGAAAAATGGCTTATTTGGGAATCTCATTGCTTTGTCATTGGGTTTGACCCCTAATTTTAAATCAAATTTTGATTTGGCAGAAGATGCTAAATTACAGCTTTCAAAAGATACTTACGATGATGATTTAGATAATTTATTGGCGCAAATTGGAGATCAATATGCTGATTTGTTTTTGGCAGCTAAGAATTTATCAGATGCTATTTTACTTTCAGATATCCTAAGAGTAAATACTGAAATAACTAAGGCTCCCCTATCAGCTTCAATGATTAAACGCTACGATGAACATCATCAAGACTTGACTCTTTTAAAAGCTTTAGTTCGACAACAACTTCCAGAAAAGTATAAAGAAATCTTTTTTGATCAATCAAAAAACGGATATGCAGGTTATATTGATGGGGGAGCTAGCCAAGAAGAATTTTATAAATTTATCAAACCAATTTTAGAAAAAATGGATGGTACTGAGGAATTATTGGTGAAACTAAATCGTGAAGATTTGCTGCGCAAGCAACGGACCTTTGACAACGGCTCTATTCCCCATCAAATTCACTTGGGTGAGCTGCATGCTATTTTGAGAAGACAAGAAGACTTTTATCCATTTTTAAAAGACAATCGTGAGAAGATTGAAAAAATCTTGACTTTTCGAATTCCTTATTATGTTGGTCCATTGGCGCGTGGCAATAGTCGTTTTGCATGGATGACTCGGAAGTCTGAAGAAACAATTACCCCATGGAATTTTGAAGAAGTTGTCGATAAAGGTGCTTCAGCTCAATCATTTATTGAACGCATGACAAACTTTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAACATAGTTTGCTTTATGAGTATTTTACGGTTTATAACGAATTGACAAAGGTCAAATATGTTACTGAAGGAATGCGAAAACCAGCATTTCTTTCAGGTGAACAGAAGAAAGCCATTGTTGATTTACTCTTCAAAACAAATCGAAAAGTAACCGTTAAGCAATTAAAAGAAGATTATTTCAAAAAAATAGAATGTTTTGATAGTGTTGAAATTTCAGGAGTTGAAGATAGATTTAATGCTTCATTAGGTACCTACCATGATTTGCTAAAAATTATTAAAGATAAAGATTTTTTGGATAATGAAGAAAATGAAGATATCTTAGAGGATATTGTTTTAACATTGACCTTATTTGAAGATAGGGAGATGATTGAGGAAAGACTTAAAACATATGCTCACCTCTTTGATGATAAGGTGATGAAACAGCTTAAACGTCGCCGTTATACTGGTTGGGGACGTTTGTCTCGAAAATTGATTAATGGTATTAGGGATAAGCAATCTGGCAAAACAATATTAGATTTTTTGAAATCAGATGGTTTTGCCAATCGCAATTTTATGCAGCTGATCCATGATGATAGTTTGACATTTAAAGAAGACATTCAAAAAGCACAAGTGTCTGGACAAGGCGATAGTTTACATGAACATATTGCAAATTTAGCTGGTAGCCCTGCTATTAAAAAAGGTATTTTACAGACTGTAAAAGTTGTTGATGAATTGGTCAAAGTAATGGGGCGGCATAAGCCAGAAAATATCGTTATTGAAATGGCACGTGAAAATCAGACAACTCAAAAGGGCCAGAAAAATTCGCGAGAGCGTATGAAACGAATCGAAGAAGGTATCAAAGAATTAGGAAGTCAGATTCTTAAAGAGCATCCTGTTGAAAATACTCAATTGCAAAATGAAAAGCTCTATCTCTATTATCTCCAAAATGGAAGAGACATGTATGTGGACCAAGAATTAGATATTAATCGTTTAAGTGATTATGATGTCGATCACATTGTTCCACAAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAACGCGTTCTGATAAAAATCGTGGTAAATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAAAACTATTGGAGACAACTTCTAAACGCCAAGTTAATCACTCAACGTAAGTTTGATAATTTAACGAAAGCTGAACGTGGAGGTTTGAGTGAACTTGATAAAGCTGGTTTTATCAAACGCCAATTGGTTGAAACTCGCCAAATCACTAAGCATGTGGCACAAATTTTGGATAGTCGCATGAATACTAAATACGATGAAAATGATAAACTTATTCGAGAGGTTAAAGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAATTCTATAAAGTACGTGAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAATGCCGTCGTTGGAACTGCTTTGATTAAGAAATATCCAAAACTTGAATCGGAGTTTGTCTATGGTGATTATAAAGTTTATGATGTTCGTAAAATGATTGCTAAGTCTGAGCAAGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACTCTAATATCATGAACTTCTTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTAATCGAAACTAATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCACAGTGCGCAAAGTATTGTCCATGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTACAGACAGGCGGATTCTCCAAGGAGTCAATTTTACCAAAAAGAAATTCGGACAAGCTTATTGCTCGTAAAAAAGACTGGGATCCAAAAAAATATGGTGGTTTTGATAGTCCAACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGGGAAATCGAAGAAGTTAAAATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCTTTGAAAAAAATCCGATTGACTTTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAAAGACTTAATCATTAAACTACCTAAATATAGTCTTTTTGAGTTAGAAAACGGTCGTAAACGGATGCTGGCTAGTGCCGGAGAATTACAAAAAGGAAATGAGCTGGCTCTGCCAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATTATGAAAAGTTGAAGGGTAGTCCAGAAGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAGCATTATTTAGATGAGATTATTGAGCAAATCAGTGAATTTTCTAAGCGTGTTATTTTAGCAGATGCCAATTTAGATAAAGTTCTTAGTGCATATAACAAACATAGAGACAAACCAATACGTGAACAAGCAGAAAATATTATTCATTTATTTACGTTGACGAATCTTGGAGCTCCCGCTGCTTTTAAATATTTTGATACAACAATTGATCGTAAACGATATACGTCTACAAAAGAAGTTTTAGATGCCACTCTTATCCATCAATCCATCACTGGTCTTTATGAAACACGCATTGATTTGAGTCAGCTAGGAGGTGACTGA SEQATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGGCGGTGATTACAGAC IDGAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAACACTGATCGCCATTCCATCAAGAAAAACNO:CTAATCGGGGCCCTTCTTTTTGATAGTGGCGAAACGGCCGAGGCGACGCGTCTAAAACGTACCGCGCGG168CGTCGCTACACCCGACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAACGAAATGGCTAAGGTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGACAAGAAGCATGAGCGACACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATCACGAAAAATATCCGACCATTTATCACCTGCGCAAAAAACTGGTCGATAGCACGGATAAAGCGGATCTGCGGCTTATTTACCTGGCGCTTGCCCACATGATCAAGTTCCGCGGCCACTTCCTGATAGAAGGAGACCTGAACCCGGATAATAGCGATGTAGACAAACTGTTTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAAATCCGATTAATGCGTCAGGCGTGGATGCGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGCCGTCTCGAAAATCTGATTGCACAGCTGCCCGGCGAAAAAAAAAACGGTCTTTTTGGCAATCTGATCGCGCTGTCACTGGGCCTGACACCAAATTTTAAGAGCAACTTCGACCTGGCAGAGGATGCGAAGCTTCAACTGTCGAAGGACACCTATGACGATGATCTGGATAATCTTCTGGCACAAATCGGTGATCAGTATGCGGATTTATTCCTTGCAGCGAAAAACCTATCTGACGCAATTCTGTTGAGCGATATCCTCCGCGTCAACACCGAAATCACTAAAGCCCCCCTGTCAGCGTCGATGATTAAACGTTATGATGAGCACCATCAGGATCTGACCTTGCTAAAGGCGCTGGTGCGACAGCAGCTTCCCGAAAAATATAAAGAGATCTTTTTTGATCAATCGAAGAATGGTTATGCCGGATACATTGATGGCGGAGCCAGTCAGGAAGAATTTTACAAATTCATCAAACCGATCCTGGAAAAAATGGATGGCACAGAAGAACTGCTTGTGAAATTGAACCGGGAAGATTTACTGCGCAAACAGCGTACGTTCGACAACGGCTCCATACCCCATCAGATTCACTTAGGTGAGCTGCATGCAATACTCCGTCGCCAGGAAGATTTTTATCCATTTTTAAAAGACAACCGTGAGAAGATTGAAAAAATTTTAACTTTTCGTATTCCATATTACGTCGGGCCTTTGGCCCGAGGTAACTCTCGATTCGCCTGGATGACGAGAAAAAGCGAGGAGACCATCACTCCGTGGAATTTTGAAGAGGTTGTTGATAAAGGCGCGAGCGCCCAGTCGTTTATCGAACGTATGACCAACTTTGATAAAAATCTGCCGAATGAAAAAGTGCTTCCGAAGCATTCTCTGTTGTATGAATATTTCACTGTGTACAATGAGTTAACGAAAGTGAAATATGTGACCGAAGGCATGCGGAAACCTGCTTTTCTGTCCGGAGAACAGAAAAAAGCAATTGTGGACCTGCTGTTCAAAACGAACCGGAAAGTAACTGTGAAGCAGCTGAAAGAGGACTACTTCAAAAAAATCGAATGCTTCGACTCAGTAGAGATCTCTGGTGTTGAAGATCGCTTCAACGCGAGTCTGGGAACGTACCATGATTTGTTGAAAATCATCAAAGATAAAGACTTTCTGGATAACGAAGAGAATGAGGACATTCTTGAAGATATTGTTTTGACACTGACTCTGTTTGAGGATCGCGAAATGATTGAAGAGCGCCTGAAAACGTATGCCCATTTATTCGATGACAAAGTCATGAAGCAGCTGAAACGTCGCCGCTATACTGGGTGGGGCAGACTTTCACGTAAATTGATCAATGGTATAAGAGACAAACAGAGCGGCAAAACTATCTTAGATTTCCTGAAGAGTGATGGATTTGCCAACCGGAATTTTATGCAGCTTATACATGATGACTCGCTAACGTTTAAAGAAGACATTCAGAAGGCGCAGGTCAGCGGCCAGGGTGATTCGCTGCATGAACACATTGCAAATCTTGCCGGATCGCCAGCGATCAAAAAAGGCATCCTTCAGACAGTAAAAGTTGTGGATGAACTGGTGAAAGTAATGGGTCGTCACAAGCCAGAAAATATTGTGATCGAAATGGCCCGGGAAAATCAGACTACTCAAAAAGGTCAGAAAAATTCTCGCGAGCGTATGAAACGTATTGAAGAAGGCATCAAAGAGCTAGGCAGCCAGATATTAAAGGAACATCCGGTTGAGAACACTCAGCTGCAGAATGAAAAACTGTATCTGTATTATCTTCAGAACGGCCGTGACATGTATGTTGATCAAGAACTGGATATCAATCGCTTGTCCGATTATGACGTGGATCATATTGTTCCGCAAAGCTTTCTGAAAGACGATTCTATTGACAATAAAGTACTGACACGTTCGGACAAAAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGTTGTTAAGAAAATGAAAAATTATTGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGATAACTTAACCAAAGCCGAGCGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTATTAAACGCCAACTGGTAGAAACCCGCCAGATAACGAAACATGTAGCTCAAATCCTCGATAGTCGCATGAATACGAAATATGACGAAAATGATAAATTGATCCGTGAAGTAAAAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCGGAAAGATTTCCAATTCTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAATGCAGTTGTTGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCGTGTATGGTGATTATAAAGTTTATGATGTTCGAAAAATGATTGCTAAGTCTGAACAGGAAATCGGCAAAGCGACCGCAAAGTATTTTTTTTATAGCAATATTATGAATTTTTTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCAAACGTCCTCTGATTGAAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGCGACCGTCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTTCAGACAGGCGGTTTTAGTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAATTGATTGCCCGCAAGAAAGATTGGGATCCGAAGAAATATGGTGGATTCGATTCTCCGACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAGGTCGAAAAAGGTAAATCCAAAAAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAACGTAGTAGTTTTGAAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAAGGATTTGATTATTAAACTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGGAAGCGTATGCTGGCGAGTGCTGGGGAATTACAGAAAGGGAATGAGTTAGCACTGCCGTCAAAATATGTGAACTTTCTGTATCTGGCCTCCCATTACGAGAAACTGAAAGGTAGCCCGGAAGATAATGAACAGAAACAACTATTTGTCGAGCAACACAAACATTATCTGGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTATTTTAGCGGATGCCAACCTTGACAAGGTGCTGAGCGCATATAATAAACACCGTGATAAACCCATTCGTGAACAGGCTGAAAATATCATACATCTGTTCACGTTAACCAACTTGGGAGCTCCTGCCGCTTTTAAATATTTCGATACCACAATTGACCGCAAACGTTATACGTCTACAAAAGAGGTGCTCGATGCGACCCTGATCCACCAGTCTATTACAGGCCTGTATGAAACTCGTATCGACCTGTCACAACTGGGCGGCGACTGA SEQATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGGCTGTAATTACAGAC IDGAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAACACAGATAGACACTCTATAAAAAAAAA NO:TTTAATAGGCGCTTTACTTTTCGATTCAGGCGAAACTGCAGAAGCGACACGTCTGAAGAGAACCGCTAG169ACGTAGATACACGAGGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAATGAGATGGCTAAGGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGATAAAAAACACGAAAGGCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACCATGAAAAATATCCCACTATATATCACCTTAGAAAAAAGTTGGTTGATTCAACCGACAAAGCGGATCTAAGGTTAATTTACCTCGCGTTGGCTCACATGATAAAATTTAGAGGACATTTCTTGATCGAAGGTGATTTAAATCCCGATAACTCTGATGTAGATAAACTGTTCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACCCCATTAACGCATCAGGTGTTGATGCTAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGACGTCTGGAAAACTTGATTGCCCAATTGCCAGGTGAAAAGAAGAACGGTCTTTTTGGAAATTTAATTGCACTTTCACTTGGGTTGACACCGAATTTTAAAAGCAATTTCGACCTCGCTGAGGATGCTAAACTCCAGTTATCTAAGGATACATATGACGATGATTTGGATAATCTATTGGCCCAGATAGGTGATCAGTATGCAGATTTGTTTTTGGCAGCTAAGAATTTATCAGATGCAATTCTACTGAGCGATATTTTAAGGGTGAATACAGAAATAACTAAAGCACCTTTGTCTGCATCTATGATAAAAAGATACGATGAACACCATCAAGATCTCACACTATTAAAAGCTTTAGTTAGACAACAATTACCAGAAAAATATAAAGAAATCTTTTTCGATCAGTCCAAGAACGGATACGCCGGCTATATAGATGGCGGTGCCTCCCAAGAAGAATTTTACAAATTTATCAAACCCATTTTGGAAAAGATGGATGGTACTGAAGAATTATTGGTCAAATTAAACAGGGAAGATTTATTAAGAAAACAAAGGACCTTTGATAATGGTTCTATTCCACACCAAATCCATCTAGGGGAATTACATGCGATTCTTAGAAGACAAGAAGATTTTTATCCATTCTTGAAAGATAACAGGGAAAAGATAGAGAAAATCTTAACTTTTAGAATTCCCTACTACGTCGGGCCCTTAGCTAGGGGGAATTCTAGATTCGCCTGGATGACACGCAAATCAGAAGAAACAATTACGCCTTGGAATTTTGAAGAAGTTGTTGATAAAGGAGCCTCTGCTCAATCTTTTATTGAACGAATGACCAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACCCAAACATTCACTCCTATACGAGTACTTTACTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGAGGGTATGCGAAAACCTGCTTTCTTGAGTGGTGAACAAAAGAAGGCCATTGTTGACTTGTTATTCAAAACTAACAGAAAGGTCACTGTGAAGCAGCTTAAAGAAGATTATTTCAAAAAGATCGAATGTTTCGACTCGGTAGAAATTAGTGGTGTGGAAGATAGATTTAATGCTTCTCTTGGAACATATCATGATCTACTAAAGATCATCAAAGATAAAGATTTCTTGGACAATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTTGACACTTACATTGTTTGAGGACAGAGAAATGATTGAAGAAAGGCTGAAGACCTACGCCCATTTGTTTGATGATAAAGTCATGAAACAGTTAAAGAGGAGAAGGTATACCGGATGGGGTAGGCTGTCTCGCAAATTGATTAATGGTATTCGTGATAAACAATCGGGTAAAACAATCCTAGATTTCCTGAAGTCCGATGGTTTCGCCAACAGGAATTTTATGCAATTGATTCATGACGATTCTTTGACTTTTAAAGAGGATATTCAGAAAGCACAGGTCTCAGGACAGGGCGATTCACTCCATGAACATATAGCTAACCTGGCTGGCTCCCCTGCTATTAAGAAAGGTATCTTGCAAACCGTCAAAGTAGTAGACGAACTTGTTAAAGTTATGGGAAGACACAAACCTGAAAATATCGTTATTGAAATGGCTCGCGAAAACCAGACAACACAAAAGGGTCAAAAGAATTCGAGAGAGAGAATGAAGCGTATCGAAGAAGGTATTAAAGAACTTGGGTCCCAAATACTTAAAGAACATCCAGTAGAAAACACTCAGCTTCAAAATGAAAAATTATACTTATATTATCTTCAGAATGGCCGCGATATGTATGTTGACCAAGAGTTAGATATAAATAGGTTGTCTGATTACGACGTGGATCATATTGTACCTCAATCTTTTCTAAAAGATGATTCAATTGATAATAAGGTATTAACGAGAAGTGATAAAAATAGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAAAATGAAAAATTATTGGCGTCAACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATAATCTAACAAAAGCTGAAAGAGGAGGCCTATCTGAGTTAGATAAGGCCGGTTTTATCAAACGTCAGTTAGTTGAAACCAGGCAAATCACGAAGCACGTTGCCCAAATTCTAGATTCAAGGATGAATACCAAATACGATGAAAACGATAAACTGATTCGGGAAGTCAAGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAAAGATTTTCAATTTTACAAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAACGCGGTCGTTGGTACAGCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTGTATATGGGGATTACAAAGTATACGACGTGAGAAAGATGATCGCTAAATCTGAACAAGAAATTGGGAAAGCAACTGCCAAATATTTTTTTTACAGCAACATAATGAATTTTTTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGAAAGCGCCCATTGATAGAGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTGCCACAGTGAGGAAAGTGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTCCAAACGGGTGGATTTAGCAAAGAATCAATTTTACCTAAGAGAAATTCAGATAAATTAATTGCCCGCAAAAAGGATTGGGATCCTAAAAAATATGGTGGTTTTGATTCCCCAACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGGTTGAAAAAGGAAAGTCTAAGAAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAGATCCTCTTTCGAAAAGAACCCTATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAAAGATTTGATCATAAAACTACCAAAGTATTCTCTATTTGAATTGGAAAACGGCAGAAAAAGGATGTTGGCAAGCGCTGGTGAACTACAAAAGGGTAACGAATTGGCATTGCCGAGTAAATACGTGAATTTTCTATATTTGGCATCACATTACGAAAAGTTAAAGGGATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTTGAACAACACAAACATTATCTTGATGAAATTATAGAACAAATTAGTGAGTTCAGTAAGAGAGTTATTTTAGCCGATGCAAATTTAGACAAAGTTTTATCTGCTTATAACAAACATAGAGATAAGCCTATAAGGGAACAAGCCGAAAATATTATTCATTTGTTTACGTTAACAAATTTAGGGGCACCAGCAGCATTCAAGTACTTCGATACGACTATCGATCGTAAGCGTTACACATCTACCAAAGAAGTTCTTGATGCAACTTTGATTCATCAATCTATAACAGGCTTATATGAAACTAGAATCGATCTGTCACAACTTGGTGGTGACTAA SEQATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGGCGGTCATCACAGAC IDGAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAATACGGACCGACATTCAATCAAGAAGAACNO:CTGATTGGAGCTCTGCTGTTTGATTCCGGTGAAACCGCCGAGGCAACACGATTGAAACGTACCGCTCGT170 AGGAGGTATACGCGGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAACGAAATGGCCAAGGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGACAAAAAGCACGAGCGCCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACCATGAAAAGTATCCAACTATTTACCACTTACGTAAGAAGTTAGTGGACAGCACCGATAAAGCCGACCTTCGCCTGATTTACCTAGCACTTGCACACATGATTAAGTTCCGAGGCCACTTCTTGATAGAGGGAGACCTGAATCCTGACAATTCCGATGTGGATAAATTGTTCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAAATCCGATTAATGCCAGTGGCGTGGACGCCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGACGCCTGGAAAATCTAATCGCACAGCTGCCCGGCGAAAAGAAAAATGGACTGTTCGGTAATTTGATCGCCCTGAGCCTGGGCCTCACCCCTAACTTTAAGTCTAACTTCGACCTGGCCGAAGATGCTAAGCTCCAGCTGTCCAAAGATACTTACGATGACGATCTCGATAATCTACTGGCTCAGATCGGGGACCAGTACGCTGACCTGTTTCTAGCTGCCAAGAACCTCAGTGACGCCATTCTCCTGTCCGATATTCTGAGGGTTAACACTGAAATTACAAAGGCCCCGCTGAGCGCGAGCATGATCAAAAGGTACGACGAGCATCACCAGGACCTCACGCTGCTGAAGGCCTTAGTCAGACAGCAACTGCCCGAAAAGTACAAAGAAATCTTTTTCGACCAATCCAAGAACGGGTACGCCGGCTACATTGATGGCGGGGCTTCACAAGAGGAGTTTTACAAGTTTATCAAGCCCATCCTGGAGAAAATGGACGGCACTGAAGAACTGCTTGTGAAACTCAATAGGGAAGACTTACTGAGGAAACAGCGCACATTCGATAATGGCTCCATACCCCACCAAATCCATCTGGGAGAGTTGCATGCCATCTTGCGAAGGCAGGAGGACTTCTACCCCTTTCTTAAGGACAACAGGGAGAAAATCGAGAAAATTCTGACTTTCCGTATCCCCTACTACGTGGGCCCACTTGCTCGCGGAAACTCACGATTCGCATGGATGACCAGAAAGTCCGAGGAAACAATTACACCCTGGAATTTTGAGGAGGTAGTAGACAAGGGAGCCAGCGCTCAATCTTTCATTGAGAGGATGACGAATTTCGACAAGAACCTTCCAAACGAGAAAGTGCTTCCTAAGCACAGCCTGCTGTATGAGTATTTCACGGTGTACAACGAACTTACGAAGGTCAAGTATGTGACAGAGGGTATGCGGAAACCTGCTTTTCTGTCTGGTGAACAGAAGAAAGCTATCGTCGATCTCCTGTTTAAAACCAACCGAAAGGTGACGGTGAAACAGTTGAAGGAGGATTACTTCAAGAAGATCGAGTGTTTTGATTCTGTTGAAATTTCTGGGGTCGAGGATAGATTCAACGCCAGCCTGGGCACCTACCATGATTTGCTGAAGATTATCAAGGATAAGGATTTTCTGGATAATGAGGAGAATGAAGACATTTTGGAGGATATAGTGCTGACCCTCACCCTGTTCGAGGACCGGGAGATGATCGAGGAGAGACTGAAAACATACGCTCACCTGTTTGACGACAAGGTCATGAAGCAGCTTAAGAGACGCCGTTACACAGGCTGGGGAAGATTATCCCGCAAATTAATCAACGGGATACGCGATAAACAAAGTGGCAAGACCATACTCGACTTCCTAAAGAGCGATGGATTCGCAAATCGCAATTTCATGCAGTTGATCCACGACGATAGCCTGACCTTCAAAGAGGACATTCAGAAAGCGCAGGTGAGTGGTCAAGGGGATTCCCTGCACGAACACATTGCTAACTTGGCTGGATCACCAGCCATTAAGAAAGGCATACTGCAGACCGTTAAAGTGGTAGATGAGCTTGTGAAAGTCATGGGAAGACATAAGCCAGAGAACATAGTGATCGAAATGGCCAGGGAAAATCAGACCACGCAAAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAGGATCGAGGAGGGCATCAAGGAGCTGGGTAGCCAGATCCTTAAAGAGCACCCAGTTGAGAATACCCAGCTGCAGAATGAGAAACTTTATCTCTATTATCTCCAGAACGGAAGGGATATGTATGTCGACCAGGAACTGGACATCAATCGGCTGAGTGATTATGACGTCGACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCATCGACAATAAAGTTCTGACCCGGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAGTCGTCAAAAAAATGAAGAACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACACAACGCAAGTTTGACAATCTCACCAAGGCAGAAAGGGGGGGCCTGTCAGAACTCGACAAAGCAGGTTTCATTAAAAGGCAGCTAGTTGAAACTAGGCAGATTACTAAGCACGTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTATGATGAGAATGATAAGCTAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATTTCAGAAAAGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCACGATGCATATCTTAATGCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAACTGGAGTCTGAATTCGTTTATGGTGACTATAAAGTCTATGACGTCAGAAAAATGATCGCAAAGAGCGAGCAGGAGATAGGGAAGGCCACAGCAAAGTACTTCTTTTACAGTAATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCGAGATCCGCAAGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGGCCGTGACTTCGCCACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTGAAGAAGACAGAAGTGCAGACGGGTGGGTTCTCAAAAGAGTCTATTCTGCCTAAGCGGAATAGTGACAAACTGATCGCACGTAAAAAGGACTGGGATCCAAAAAAGTACGGCGGATTCGACAGTCCTACCGTTGCATATTCCGTGCTTGTGGTCGCTAAGGTGGAGAAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCATAACGATCATGGAGCGCTCCAGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATACAAGGAGGTAAAGAAAGACCTTATCATTAAGCTGCCTAAGTACAGTCTGTTCGAACTGGAGAATGGGAGGAAGCGCATGCTGGCATCAGCTGGAGAACTCCAAAAAGGGAACGAGTTGGCCCTCCCCTCAAAGTATGTCAATTTTCTCTACCTGGCTTCTCACTACGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCAGAAACAGCTGTTTGTGGAACAGCACAAGCACTATTTGGACGAAATCATCGAACAAATTTCCGAGTTCAGTAAGAGGGTGATTCTGGCCGACGCAAACCTTGACAAAGTTCTGTCCGCATACAATAAGCACAGAGACAAACCAATCCGCGAGCAAGCCGAGAATATAATTCACCTTTTCACTCTGACTAATCTGGGGGCCCCCGCAGCATTTAAATATTTCGATACAACAATCGACCGGAAGCGGTATACATCTACTAAGGAAGTCCTCGATGCGACACTGATCCACCAGTCAATTACAGGTTTATATGAAACAAGAATCGACCTGTCCCAGCTGGGCGGCGACTAG SEQAAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGAIDTCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATNO:GAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgc171gtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQ ID AATTCAAAGGATAATCAAAC NO:172 SEQ ID AATCTCTACTCTTTGTAGAT NO: 173 SEQ ID AATTTCTACTGTTGTAGAT NO:174 SEQ ID AATTTCTACTAGTGTAGAT NO: 175 SEQ ID AATTTCTACTATTGT NO: 176SEQ ID AATTTCTACTGTTGTAGA NO: 177 SEQ ID AATTTCTACTATTGTA NO: 178 SEQ IDAATTTCTACTTTTGTAGAT NO: 179 SEQ ID AATTTCTACTGTTGTAGAT NO: 180 SEQ IDAATTTCTACTCTTGTAGAT NO: 181

1. A nucleic acid-guided nuclease system comprising: (a) a nucleicacid-guided nuclease comprising the amino acid sequence of SEQ ID NO: 7;(b) a heterologous engineered guide nucleic acid for complexing with thenucleic acid-guided nuclease, wherein said heterologous engineered guidecomprises a pseudoknot region comprising nucleotide residues 18-36 ofSEQ ID NO: 200 or nucleotide residues 18-36 of SEQ ID NO: 201, and (c)an editing sequence having a change in sequence relative to the sequenceof a target region; wherein the system results in an edit in the targetregion facilitated by the nuclease, the heterologous guide nucleic acid,and the editing sequence.
 2. (canceled)
 3. (canceled)
 4. The system ofclaim 1, wherein the target region is within a eukaryotic cell.
 5. Thesystem of claim 1, wherein the target region is within a bacterial cell.6. The system of claim 1, wherein the target region is within a plantcell.
 7. The system of claim 1, wherein the target region is within amammalian cell.
 8. The system of claim 1, wherein the heterologousengineered guide nucleic acid comprises any one of SEQ ID NO: 88, SEQ IDNO: 93, SEQ ID NO: 94, and SEQ ID NO:
 95. 9. The system of claim 1,wherein the heterologous engineered guide nucleic acid and the editingsequence are provided as a single nucleic acid.
 10. The system of claim1, wherein the editing sequence further comprises a mutation in aprotospacer adjacent motif (PAM) site.
 11. A method of modifying atarget region, the method comprising: (a) contacting a sample comprisingthe target region with: (i) a nucleic acid-guided nuclease comprisingthe amino acid sequence of SEQ ID NO: 7; and (ii) a heterologousengineered guide nucleic acid for complexing with the nucleicacid-guided nuclease, wherein said heterologous engineered guidecomprises a pseudoknot region having a sequence comprising nucleotideresidues 18-36 of SEQ ID NO: 200 or nucleotide residues 18-36 of SEQ IDNO: 201; and (b) allowing the nuclease and heterologous guide nucleicacid to modify the target region.
 12. The method of claim 11, whereinthe sample is further contacted with (iii) an editing sequence having achange in sequence relative to the sequence of the target region. 13.The method of claim 12, wherein modifying the target results in an editin the target region facilitated by the nuclease, the heterologousengineered guide nucleic acid, and the editing sequence.
 14. The methodof claim 12, wherein the heterologous engineered guide nucleic acid andthe editing sequence are provided as a single nucleic acid.
 15. Themethod of claim 12, wherein the editing sequence further comprises amutation in a protospacer adjacent motif (PAM) site.
 16. The method ofclaim 11, wherein the target region is within a eukaryotic cell.
 17. Themethod of claim 11, wherein the target region is within a bacterialcell.
 18. The method of claim 11, wherein the target region is within aplant cell.
 19. The method of claim 11, wherein the target region iswithin a mammalian cell.
 20. The method of claim 11, wherein theheterologous engineered guide nucleic acid comprises any one of SEQ IDNO: 88, SEQ ID NO: 93, SEQ ID NO: 94, and SEQ ID NO:
 95. 21. A nucleicacid-guided nuclease system comprising: (a) a nucleic acid-guidednuclease that binds to a conserved pseudoknot region having a sequencecomprising nucleotide residues 18-36 of SEQ ID NO: 200 or nucleotideresidues 18-36 of SEQ ID NO: 201; (b) a heterologous engineered guidenucleic acid for complexing with the nucleic acid-guided nuclease,wherein said heterologous engineered guide comprises the conservedpseudoknot region having a sequence comprising nucleotide residues 18-36of SEQ ID NO: 200 or nucleotide residues 18-36 of SEQ ID NO: 201, and(c) an editing sequence having a change in sequence relative to thesequence of a target region; wherein the system results in an edit inthe target region facilitated by the nuclease, the heterologousengineered guide nucleic acid, and the editing sequence.